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B&P FUe No. 10447-009/JRR 

Tide; IMPROVED RICIN-LIKE TOXINS FOR TREATMENT 

OF CANCER 

FTFXP OF THE INVENTION 
5 The invention relates to proteins useful as therapeutics 

agamst cancer. The proteins contain A and B chains of a ricin-like toxin 
linked by a novel linker sequence that is specifically cleaved and activated 
by proteases specific to cancer. 
BACKGROUND OF THE INVENTION 
10 Bacteria and plants are known to produce cytotoxic proteins 

which may consist of one, two or several polypeptides or suburuts. Those 
proteins having a single subunit may be loosely classified as T3^e I 
proteins. Many of the cytotoxins which have evolved two subunit 
structures are referred to as type 11 proteins (Saelinger, C.B. in Trafficking 
15 of Bacterial Toxins (eds. Saelinger, C.B.) 1-13 (CRC Press Inc., Boca Raton, 
Florida, 1990). One subunit, the A chain, possesses the toxic activity 
whereas the second subxmit, the B chain, binds cell surfaces and mediates 
entry of the toxin into a target cell. A subset of these toxins kill target cells 
by inhibiting protein biosynthesis. For^example, bacterial toxins such as 
20 diphtheria toxin or Pseudomonas exotoxin inhibit protein synthesis by 
inactivating elongation factor 2. Plant toxins such as ricin, abrin, and 
bacterial toxin Shiga toxin, inhibit protein synthesis by directly inactivating 
the ribosomes (Olsnes, S. & Phil, A. in Molecular action of toxins and 
viruses (eds. Cohen, P. & vanHeyningen, S.) 51-105 Elsevier Biomedical 
25 Press, Amsterdam, 1982). 

Ricin, derived from the seeds of Ricinus communis (castor oil 
plant), may be the most potent of the plant toxins. It is estimated that a 
single ricin A chain is able to inactivate ribosomes at a rate of 1500 rib 
osomes /minute. Consequently, a single molecule of ricin is enough to kill 
30 a cell (Olsnes, S. & Phil, A. in Molecular action of toxins and viruses (eds. 
Cohen, P. & vanHeyningen, S.) (Elsevier Biomedical Press, Amsterdam, 
1982). The ricin toxin is a glycosylated heterodimer consisting of A and B 
chains with molecular masses of 30,625. Da and 31,431 Da linked by a 
disxilphide bond. The A chain of ricin has an N-glycosidase activity and 
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catalyzes the excision of a specific adenine residue from the 28S rRNA of 
eukaryotic ribosomes (Endo, Y. & Tsurugi, K. J., Biol Chem, 262:8128 
(1987)). The B chain of ricin, although not toxic in itself, promotes the 
toxicity of the A chain by binding to galactose residues on the surface of 
5 eukaryotic cells and stimulating receptor-mediated endocy tosis of the toxin 
molecule (Simmons et al., Biol. Chem. 261:7912 (1986)). Once the toxin 
molecule consisting of the A and B chains is internalized into the cell via 
clathrin-dependent or independent mechanisms, the greater reduction 
potential within the cell induces a release of the active A chain, eliciting its 
10 inhibitory effect on protein synthesis and its cytotoxicity (Emmanuel, F. et 
aL, Anal. Biochem. 173: 134-141 (1988); Blum, J.S. et aL, /. Biol Chem. 266: 
22091-22095 (1991); Fiani, M.L. et al.. Arch, Biochem, Biophys. 307: 225-230 
"^^^ (1993)). Empirical evidence suggests that activated toxin (e.g. ridn, shiga 

4^ toxin and others) in the endosomes is transcytosed flirough the trans-Golgi 

^ 15 network to the endoplasmic reticulimi by retrograde transport before the 

3 A chain is translocated into the cytoplasm to elicit its action (Sandvig, K. & 

3 van Deurs, B., FEBS Lett. 346: 99-102 (1994). 

H Protein toxins are initially produced in an inactive, precxxrsor 

Q form. Ridn is initially produced as a single polypeptide (preproricin) with 

^ 20 an amino acid N-terminal presequence and 12 amino add linker between 

the A and B chains. The pre-sequence is removed during translocation of 
the ricin precursor into the endoplasmic reticulum (Lord, J.M., Eur. 
Biochem. 146:403-409 (1985) and Lord, J.M., Eur. J. Biochem. 146:411-416 
(1985)). The proridn is flien translocated into specialized organelles called 
25 protein bodies where a plant protease deaves the protein at a linker region 
between the A and B chains (Lord, J.M. et al., FASAB journal 8:201-208 
(1994)). The two chains, however, remain covalently attached by an 
interchain disulfide bond (cysteine 259 in the A chain to cysteine 4 in the B 
chain) and mature disulfide linked ricin is stored in protein bodies inside 
30 the plant cells. The A chain is inactive in proricin (O'Hare, M. et al., FEBS 
Lett. 273:200-204 (1990)) and it is inactive in the disulfide-linked mature 
ricin (Richardson, P.T. et al., FEBS Lett. 255:15-20 (1989)). The ribosomes of 
the castor bean plant are themselves susceptible to inactivation by ricin A 
chain; however, as there is no cell surface galactose to permit B chain 
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recognition the A chain cannot re-enter the cell. The exact mecharusm of A 
chain release and activation in target cell cytoplasm is not known (Lord, 
J.M. et al., FASAB journal 8:201-208 (1994)). However, it is known that for 
activation to take place the disulfide bond between the A and B chains 
5 must be reduced and, hence, the linkage between subtmits broken. 

Diphtheria toxin is produced by Corynebacterium diphtheriae 
as a 535 amino acid polypeptide with a molecvilar weight of approximately 
58kD (Greei\field, L. et aL, Proc, Natl Acad. Set USA 80:6853-6857 (1983); 
Pastan, L et aL, Annu. Rev, Biochem. 61:331-354 (1992); Collier, R.J. & 
10 Kandel, J., 1. Biol Chem. 246:1496-1503 (1971)). It is secreted as a single- 
chain polypeptide consisting of 2 functional domains. Similar to proricin, 
P the N-terminal domain (A-chain) contains the cytotoxic moiety whereas 

the C-terminal domain (B-chain) is responsible for binding to the cells and 
Jz facilitates toxin endocytosis. Conversely, the mechanism of cytotoxicity 

J 15 for diphtheria toxin is based on ADP-ribosylation of EF-2 thereby blocking 

s protein synthesis and producing cell death. The 2 fimctional domains in 

% diphtheria toxin are linked by an arginine-rich peptide sequence as well as 

H« a disulphide bond. Once the diphtheria toxin is internalized into the cell, 

Q the arginine-rich peptide linker is cleaved by trypsin-like enzymes and the 

Cl 20 disulphide bond (Cys 186-201) is reduced. The cytotoxic domain is 

subsequentiy translocated into the cytosol substantially as described above 
for ricin and elicits ribosomeil inhibition and cytotoxicity. 

Pseudomonas exotoxin is also a 66kD single-chain toxin 
protein secreted by Pseudomonas aeruginosa with a similar mechanism of 
25 cytotoxicity to that of diphtheria toxin (Pastan, I. et al., Annu. Rev, Biochem. 
61:331-354 (1992); Ogata, M. et aL, J. Biol Chem, 267:25396 25401 (1992); 
Vagil, M.L. et al.. Infect. Immunol. 16:353-361 (1977)). Pseudomonas exotoxin 
consists of 3 conjoint functional domains. The first domain la (amino acids 
1-252) is responsible for cell binding and toxin endocytosis, a second 
30 domain 11 (amino acids 253-364) is responsible for toxin translocation from 
the endocytic vesicle to the cytosol, and a third domain HI (amino acids 
400-613) is responsible for protein synthesis inhibition and cytotoxicity. 
After Pseudomonas exotoxin enters the cell, the liberation of the cytotoxic 
domain is effected by both proteolytic cleavage of a polypeptide sequence 
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in the second domain (near Arg 279) and the reduction of the disulphide 
bond (Cys 265-287) in the endocytic vesicles. In essence, the overall 
pathway to cytotoxicity is analogous to diphtheria toxin with the exception 
that the toxin translocation domain in Pseudomonas exotoxin is structurally 
distinct. 

Class 2 ribosomal iiihibitory proteins (RIP-2) constitute 
other toxins possessing distinct hinctional domains for cytotoxicity and cell 
binding /toxin translocation which include abrin, modeccin, volkensin^ 
(Sandvig, K. et al., Biochem, Soc. Trans, 21:707-711 (1993)) and mistle toe 
lectin (viscumin) (Olsnes, S. & Phil, A. in Molecular action of toxins and 
viruses (eds. Cohen, P. & vanHeyningen, S.) 51-105 Elsevier Biomedical 
Press, Amsterdam, 1982; Fodstad, et al. Cane. Res. 44: 862 (1984)). Some 
toxins such as Shiga toxin and cholera toxin also have multiple polypeptide 
chains responsible for receptor binding and endocytosis. 

The ricin gene has been cloned and sequenced, and the X- 
ray crystal structures of the A and B chains have been described (Rutenber, 
E. et al. Proteins 10:240-250 (1991); Weston et al., Mol. Bio, 244:410-422,1994; 
Lamb and Lord, Eur. }. Biochem, 14:265 (1985); Hailing, K. et al. Nucleic Acids 
Res, 13:8019 (1985)). Similarly, the genes for diptheria toxin and 
Pseudomonas exotoxin have been cloned and sequenced, and the 3- 
dimensional structures of the toxin proteins have been elucidated and 
described (Columblatti, M. et al., /, Biol Chem. 261:3030-3035 (1986); 
Allured, V.S. et al., Proc. Natl. Acad. Sci. USA 83:1320-1324 (1986); Gray, 
G.L. et al., Proc. Natl. Acad. Sci. USA 81:2645-2649 (1984); Greenfield, L. et 
al., Proc, Natl. Acad. Sci. USA 80:6853-6857 (1983); Collier, R.J. et al., J. BioL 
Chem. 257:5283-5285 (1982)). 

The potential of bacterial and plant toxins for inhibiting 
mammalian retroviruses, particularly acquired immunodeficiency 
syndrome (AIDS), has been investigated. Bacterial toxins such as 
Pseudomonas exotoxin-A and subunit A of diphtheria toxin; dual chain 
ribosomal inhibitory plant toxins such as ricin, and single chain ribosomal 
inhibitory -proteii\s such as trichosanthin and pokeweed antiviral protein 
have been used for the elimination of HIV infected cells (Okon et al., AIDS 
Res, and Human Retroviruses 7:1025-1030 (1991)). The high toxicity of these 
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toxins for mammalian cells, combined with a lack of specificity of action 
poses a major problem to the development of pharmaceuticals 
incorporating the toxins, such as immvmotoxins. 

Due to their extreme toxicity there has been much interest 
5 in making ricin-based immunotoxins as therapeutic agents for specifically 
destroying or inhibiting infected or tumourous cells or tissues (Vitetta et 
al.. Science 238:1098-1104(1987)). An immunotoxin is a conjugate of a 
specific cell binding component, such as a monoclonal antibody or growth 
factor and the toxin in which the two protein components are covalently 
10 linked. Generally, the components are chemically coupled. However, the 
linkage may also be a peptide or disulfide bond. The antibody directs the 
toxin to cell types presenting a specific antigen thereby providing a 
'^M specificity of action not possible with the natmral toxin. Immunotoxins 

have been made both with the entire ricin molecule (i.e. both chains) and 
'5 15 with the ricin A chain alone (Spooner et al., Mol. Immtmol. 31:117-125, 

f (1994)). 

Immunotoxir\s made with the ricin dimer (IT-Rs) are more 
|=U potent toxins than those made with only the A chain (IT- As). The 

% increased toxicity of IT-Rs is thought to be attributed to the dual role of the 

O 20 B chains in binding to the cell surface and in translocating the A chain to the 

cytosolic compartment of the target cell (Vitetta et al.. Science 238:1098- 
1104 (1987); Vitetta & Thorpe, Seminars in Cell Biology 2:47-58 (1991)). 
However, the presence of the B chain in these conjugates also promotes 
the entry of the immimotoxin into nontarget cells. Even small amounts of 
25 B chciin may override the specificity of the cell-binding component as the B 
chain will bind nonspecifically to galactose associated with N-linked 
carbohydrates, which is present on most cells.IT-As are more specific and 
safer to use than IT-Rs. However, in the absence of the B chain the A chain 
has greatly reduced toxicity. Due to the reduced potency of IT-As as 
30 compared to IT-Rs, large doses of IT-As mxist be administered to patients. 
The large doses frequently cause immune responses and production of 
neutralizing antibodies in patients (Vitetta et al.. Science 238:1098-1104 
(1987)). IT-As and IT-Rs both suffer from reduced toxicity as the A chain is 
not released from the conjugate into the target cell cytoplasm. 
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A number of immunotoxins have been designed to 
recognize antigens on the surfaces of tumour cells and cells of the imm\me 
system (Pastan et al.. Annals New York Academy of Sciences 758:345-353 
(1995)). A major problem with the use of such immunotoxins is tiiat the 
5 antibody component is its only targeting mechanism and the target 
antigen is often foimd on non-target cells (Vitetta et al.. Immunology Today 
14:252-259 (1993)). AlsO/ the preparation of a suitable specific cell binding 
component may be problematic. For example, antigens specific for the 
target cell may not be available and many potential target cells and 
10 infective organisms can alter their antigenic make up rapidly to avoid 
ff= immune recognition. In view of the extreme toxicity of proteins such as 

ricin, the lack of specificity of the immimotoxins may severely limit their 
usefulness as therapeutics for the treatment of cancer and infectious 
diseases. 

;^ 15 The insertion of intramolecular protease cleavage sites 

s between the cytotoxic and cell-binding components of a toxin can mimic 

the way that the natural toxin is activated. European patent application no. 
M 466,222 describes the use of maize-derived pro-proteins which can be 

2| converted into active form by cleavage with extracellular blood enzymes 

Ci 20 such as factor, Xa, thrombin or coilagenase. Garred, 0. et al. (J. BioL Chem, 

270:10817-10821 (1995)) documented the use of a ubiquitous calcium- 
dependent serine protease, furin, to activate shiga toxin by cleavage of the 
trypSin-sensitive linkage between the cytotoxic A-chain and the pentsuner 
of cell-binding B-units. Westby et al. ('Bioconjugate Chem. 3:375-381 (1992)) 
25 documented fusion proteins which have a specific cell binding component 
and proricin with a protease sensitive cleavage site specific for factor Xa 
within the linker sequence. O'Hare et al. (FEBS Lett. 273:200-204 (1990)) 
also described a recombinant fusion protein of RTA and staphylococcal 
protein A joined by a trypsin-sensitive cleavage site. In view of the 
30 ubiquitous nature of the extracellular proteases utilized in these 
approaches, such artificial activation of the toxin precursor or 
immunotoxin does not confer a mechanism for intracellular toxin 
activation and the problems of target specificity and adverse 



immunological reactions to the cell-binding component of the 
immimotoxin remain. 

In a variation of the approach of insertion of intramolecular 
protease cleavage sites on proteins which combine a binding chain and a 
5 toxic chain, Leppla, S.H. et al. (Bacterial Protein Toxins zbl.bakt.suppl. 
24:431-442 (1994)) suggest the replacement of the native cleavage site of 
the protective antigen (PA) produced by Bacillus anihracis v^ith a cleavage 
site that is recognized by cells that contain a particular protease. PA, 
recognizes, binds, and thereby assists in the internalization of lethal factor 
10 (U) and edema toxin (ET), also produced by Bacillus anihracis. However, 
m this approach is wholly dependent on the availability of LF, or ET and PA 

aU being localized to cells wherein the modified PA can be activated by the 
■n specific protease. It does not confer a mechanism for intracellular toxin 

activation and presents a problem of ensuring sufficient quantities of toxin 

□ 15 for internalization in target cells. 

The in vitro activation of a Staphylococcus-deiived pore 
forming toxin, (x-hemolysin by extracellular tumour-associated proteases 
CI has been dociunented (Panchel, R.G. et al.. Nature Biotechnology 14:852-857 

41 (1996)). Artificial activation of a-hemolysin in vitro by said proteases was 

□ 20 reported but the actual activity and utility of a-hemolysin in the 

destruction of target cells were not demonstrated. 

Hemolysin does not inhibit protein sjoithesis but is a 
heptameric transmembrane pore which acts as a channel to 2illow leakage 
of molecules up to 3 kD thereby disrupting the ionic balances of the living 

25 cell. The a-hemolysin activation domain is likely located on the outside of 
the target cell (for activation by extracellular proteases). The triggering 
mechanism in the disclosed hemolysin precursor does not involve the 
intracellular proteolytic cleavage of 2 functionally distinct domains. Also, 
the proteases used for the a-hemolysin activation are ubitquitiously 

30 secreted extracellular proteases and toxin activation would not be confined 
to activation in the vicinity of diseased cells. Such widespread activation of 
the toxin does not confer target specificity and limits the usef vdness of said 
a-hemolysin toxin as therapeutics due to systemic toxicity. 
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A variety of proteases specifically associated with 
maUgnancy have been identified and described. For example, cathepsin is 
a family of serine, cysteine or aspartic endopeptidases and exopeptidases 
which has been implicated to play a primary role in cancer metastasis 
5 (Schwartz, M.K., Clin. Chim. Acta 237:67-78 (1995); Spiess, E. et al., J. 
Histochem. Cytochem. 42:917-929 (1-994); Scarborough, P.E. et al.. Protein 
Sci. 2:264276 (1993); Sloane, B.F. et al., Proc. Natl. Acad. Sci. USA 83:2483- 
2487 (1986); Mikkelsen, T. et al., J. Neurosurge 83:285-290 (1995)). Matrbc 
metalloproteinases (MMPs or matrixii\s) are zinc-dependent proteinases 
10 consisting of coUagenases, matrilysin, stromelysins, gelatinases and 
macrophage elastase (Krane, S.M., Ann, N.Y. Acad. Sci. 732:1-10 (1994); 
Woessner, J.F., Ann, N.Y. Acad. Sci. 732:11-21 (1994); Carvalho, K. et al., 
Biochem. Biophys. Res., Comm. 191:172-179 (1993); Nakano, A. et al. J. of 
Neurosurge, 83:298-307 (1995); Peng, K-W, et al. Human Gene Therapy, 8:729- 
15 738 (1997); More, D.H. et al. Gynaecologic oncology, 65:78-82 (1997)). These 
proteases are involved in pathological matrix remodeling. Under normal 
physiological conditicms, regulation of matrbdn activity is effected at the 
level of gene expression. Eruymatic activity is also controlled stringently 
by tissue inhibitors of metalloproteinases (UMPs) (Murphy, G. et al., Ann. 
20 N.Y. Acad. Sci. , 732:31-41 (1994)). The expression of MMP genes is reported 
to be activated in inflammatory disorders (e.g. rheumatoid arthritis) and 
malignancy. 

The present inventor has prepared novel recombinant toxic 
proteins which are specifically toxic to diseased cells but do not depend for 

25 their specificity of action on a specific cell binding component. The 
recombinant proteins toxins have an A chain of a ridn-like toxin linked to a 
B chain by a synthetic linker sequence which maybe cleaved specifically by 
a protease localised in cells or tissues affected by a specific disease to 
liberate the toxic A chain thereby selectively inhibiting or destroying the 

30 diseased cells or tissues (WO 98/49311 published November 5, 1998 which 
is incorporated herein by reference). 
giTMM ARY OF THK INVENTION 

The present invention relates to novel linker sequences that 
can be used to prepare recombinant toxic proteins having an A chain of a 
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ricin-like toxin linked to a B chain by the linker sequence. The novel linker 
sequences of the invention are illustrated in Figxires 1-20. 

In one aspect the present invention provides a purified and 
isolated nucleic add encoding a linker sequence comprising: the nucleic acid 
5 sequence of pAP301 as shown in Figure lA; the nucleic acid sequence of 
pAP302 as shown in Figure 2A; the nucleic acid sequence of pAP303 as 
shown in Figure 3A; the nucleic acid sequence of pAP304 as shown in 
Figure 4A; the nucleic acid sequence of pAP305 as shown in Figure 5A; the 
nucleic acid sequence of pAP308 as shown in Figure 6A; the nucleic acid 
10 sequence of pAP309 as shown in Figure 7 A; the nucleic acid sequence of 
3t pAP313 as shown in Figure 8A; the nucleic acid sequence of pAP314 as 

shown in Figure 9A; the nucleic acid sequence of pAP315 as shown in 
Figure lOA; the nucleic acid sequence of pAP316 as shown in Figure 11 A; 
the nucleic acid sequence of p APS 17 as shown in Figure 12A; the nucleic 
15 add sequence of pAP318 as shown in Figtue 13A; the nucleic add sequence 
of pAP319 as shown in Figixre 14A; the nucleic acid sequence of pAP320 as 
shown in Figure 15A; the nucleic acid sequence of pAP321 as shown in 
Figure 16A; the nucleic acid sequence of pAP322 as shown in Figtire 17A; 
the nucleic acid sequence of pAP323 as shown in Figure ISA; ttie nucleic 
20 acid sequence of pAP324 as shown in Figure 19A; and the nucleic acid 
sequence of pAP325 as shown in Figure 20A. 

In another aspect, the present invention provides a purified 
and isolated nucleic acid encoding a recombinant toxic protein comprising 
(a) a nucleotide sequence encoding an A chain of a ricin-like toxin, (b) a 
25 nucleotide sequence encoding a B chain of a ricin-like toxin and (c) a 
heterologous linker amino add sequence, linking the A and B chains. The 
liiJicer sequence is not a native linker sequence of a ricin-like toxin, but 
rather a synthetic heterologous linker sequence containing a cleavage 
recognition site for a disease-specific protease. The A and or the B chain 
30 may be those of ricin. 

The recombinant toxic proteins employing the novel linker 
sequences of the present invention may be used to treat various forms of 
cancer induding, but not limited to, T- and B-cell lymphoproliferative 
diseases, ovarian cancer, pancreatic cancer, head and neek cancer. 
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squamous cell carcinoma, gastrointestinal cancer, breast cancer, prostate, 
cancer and non small cell lung cancer. In an embodiment, of the invention 
the cleavage recognition site of the linker is the cleavage recognition site 
for a cancer-assodated protease. 

In particular embodiments, the amino acid sequence of the 
linker comprises the sequence of PAP301 shown in Figure IC; the 
sequence of PAP302 shown in Figure 2C; the sequence of PAP303 shown in 
Figure 3C; the sequence of PAP304 shown in Figure 4C; the sequence of 
PAP305 shown in Figiore 5C; the sequence of PAP308 shown in Figure 6C; 
the sequence of PAP309 shown in Figure 7C; the sequence of PAP316 
shown in Figure IIC; the sequence of PAP317 shown in Figure 12C; the 
sequence of PAP318 shown in Figure 13C; the sequence of PAP319 shown 
Figure 14C; the sequence of PAP323 shown in Figure 18C; the sequence of 
PAP324 shown in Figture 19C; and the sequence of PAP325 shown in 
Figure 20C; all cleaved by MMP-9; the sequence of PAP313 shown in 
Figure 8C; the sequence of PAP314 shown in Figure 9C; the sequence of 
PAP315 shown in Figure IOC; the sequence of PAP320 shown in Figure 
15C; the sequence of PAP321 shown in Figure 16C; the sequence of 
PAP322 shown in Figure 17C; all cleaved by tirokinase-type plasminogen 
activator. 

In a preferred embodiment, the nucleic acid sequences of 
the recombinant toxic proteins containing ricin A and B chains with each of 
ti\e linker sequences are shown in Figures IB, 2B, 3B, 4B, 5B, 6B, 7B, 8B, 9B, 
lOB, IIB, 12B, 13B, 14B, 15B, 16B, 17B 18B, 19B and 20B. 

The present invention also provides a plasmid incorporating 
the nucleic acid of the invention. In another embodiment, the present 
invention provides a baculovirus transfer vector incorporating the nucleic 
acid of the invention. 

In a further aspect, the present invention provides a 
recombinant protein comprising an A chain of a ricin-like toxin, a B chain 
of a ricin-like toxin and a heterologous linker amino acid sequence, linking 
the A and B chains, wherein the linker sequence contains a cleavage 
recognition site for a cancer-specific protease. The A and /or the B chain 
may be those of ricin. In an embodiment, the cleavage recognition site is 
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the cleavage recognition site for a cancer protease substantially as 
described above. In a particxilar embodiment, the cancer is T-cell or B-cell 
lymphoproliferative disease, ovarian cancer, pancreatic cancer, head and 
neck cancer, squamous cell carcinoma, gastrointestinal cancer, breast 
5 cancer, prostate cancer, non small cell Itmg cancer. 

In a further aspect, the invention provides a pharmaceutical 
composition for treating cancer comprising a recombinant protein of the 
invention and a pharmaceutically acceptable carrier, diluent or excipient. 

In yet another aspect, the invention provides a method of 
10 inhibiting or destroying cancer cells, which cancer cells are associated with 
fli a specific protease, comprising the steps of preparing a recombinant 

protein of the invention having a heterologous liiJser sequence which 
'Ji contains a cleavage recogrution site for the cancer specific protease and 

^ administering the recombinant protein to the cells. In an embodiment, the 

15 cancer is T-cell or B-cell lymphoproliferative disease, ovarian cancer, 
pancreatic cancer, head and neck cancer^ squamous cell carcinoma, 
gastrointestinal cancer, breast cancer, prostate cancer, non small cell lung 
cancer. 

The present invention also relates to a method of treating a 
20 cancer wherein the cells affected by the cancer are associated with a specific 
protease by administering an effective amount of one or more 
recombinant proteins of the invention to an animal in need thereof. 

Still further, a process is provided for preparing a 
pharmaceutical for treating cancer wherein cells affected by the camcer are 
25 associated with a specific protease comprising the steps of preparing a 
purified and isolated nucleic acid having a nucleotide sequence encoding an 
A chain of a ricin-like toxin, a B chain of a ricin-like toxin and a 
heterologous linker amino acid sequence, linking the A and B chains, 
wherein the linker sequence contains a cleavage recognition site for the 
30 specific protease; introducing the nucleic acid into a host cell; expressing the 
nucleic acid in the host cell to obtain a recombinant protein comprising an 
A chain of a ricin-like toxin, a 13 chain of a ricin-like toxin and a 
heterologous linker amino acid sequence, linking the A and B chains 
wherein the linker sequence coittains the cleavage recogrution site for the 
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specific protease; and sxispending the protein in a pharmaceutically 
acceptable carrier, diluent or excipient. 

Other features and advantages of the present invention will 
become apparent from the following detailed description. It should be 
5 understood, however, that the detailed description and the specific 
examples while indicating preferred embodiments of the invention are 
given by way of illustration only, since various changes and modi£icatioi\s 
within the spirit and scope of the invention will become apparent to those 
skilled in the art from this detailed description. 
10 DESCRIPTION OF THE DRAWINGS 
yl The invention will be better understood with reference to 

the drawings in which: 

^ Figure lA shows the nucleotide sequence of the MMP-9 

linker region of pAPSOl; 
y 15 Figure IB shows the nucleotide sequence of tiie pAP301 

insert containing ridn and the MMP-9 linker; 
'% Figure IC shows the amino acid sequence of the PAP301 

|«« linker and the wild type ricin linker; 

r22 Figure 2A shows the nucleotide sequence of the MMP-9 30 

O 20 linker region of pAP302; 

Figure 2B shows the nucleotide sequence of the pAP302 
insert containing ricin and the MMP-9 linker; 

Figure 2C shows the amino acid sequence of the PAP302 
linker and the vsdld type ricin linker; 
25 Figure 3A shows the nucleotide sequence of the MMP-9 

Unker region of pAP303; 

Figure 3B shows the nucleotide sequence of the pAP303 
insert contairving ricin amd the MMF-9 linker; 

Figure 3C shows the amino acid sequence of the PAP303 
30 linker and the wild type ricin linker; 

Figure 4A shows the nucleotide sequence of the MMP-9 
linker region of pAP304; 

Figure 4B shows the nucleotide sequence of the pAP304 
insert containing ricin and the MMP-9 linker; 
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Figure 4C shows the amino acid sequence of the PAP304 
linker and the wild type ricin linker; 

Figure 5A shows the nucleotide sequence of the MMP-9 
linker region of pAP305; 
5 Figure 5B shows the nucleotide sequence of the pAP305 

insert containing ricin and the MMP-9 linker; 

Figure 5C shows the amino acid sequence of the PAP305 
linker and the wild type ridn linker; 

Figure 6A shows the nucleotide sequence of the MMP-9 
10 linker region of pAP308; 

Figure 6B shows the nucleotide sequence of the pAP308 
insert containing ricin and the MMP-9 linker; 

Figure 6C shows the amino acid sequence of the pAP308 
linker and the wild type ridn linker; 
15 Figure 7 A shows the nucleotide sequence of the MMP-9 

linker region of pAP309; 

Figure 7B shows the nucleotide sequence of the pAP309 
insert containing ricin and the MMP-9 linker; 

Figure 7C shows the amino acid sequence of the PAP309 
20 linker and the wild type ricin lii\ker; 

Figure 8A shows the nucleotide sequence of the UPA linker 
region of pAP313; 

Figure 8B shows the nucleotide sequence of the pAP313 
insert containing ridn and tiie UPA linker; 
25 Figure 8C shows the amino add sequence of tfie PAP313 

linker and the wild type ridn linker; 

Figure 9A shows ttie nucleotide sequence of the UPA linker 
region of pAP314; 

Figure 9B shows the nucleotide sequence of the pAP314 
30 insert containing ricin and the UPA linker; 

Figure 9C shows the amino acid sequence of the PAP314 
linker and the wild type ricin liiJcer; 

Figure lOA shows the nucleotide sequence of the UPA 
lirJfcer region of pAP315; 
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Figure lOB shows the nucleotide sequence of the pAP315 
insert containing ricin and the UPA linker; 

Figure IOC shows the amino acid sequence of the PAP315 
linker and the wild type ricin linker; 
5 Figure 11 A shows the nucleotide sequence of the MMP-9 

linker region of pAP316; 

Figure IIB shows the nucleotide sequence of the pAP316 
insert containing ricin and the MMP-9 linker; 

Figure IIC shows the amino acid sequence of the PAP316 
10 linker and the wild type ricin linker; 

Figure 12A shows the nucleotide sequence of the MMP-9 
linker region of pAP317; 

Figure 12B shows nucleotide sequence of the pAP317 insert 
containing ricin and the MMP-9 linker; 
15 Figure 12C shows the amino add sequence of the PAP317 

linker and tt\e wild type ricin linker; 

Figure 13A shows the nucleotide sequence of the MMP-9 
linker region of pAP318; 

Figure 13B shows the nucleotide sequence of the pAP318 
20 insert containing ridn and the MMP-9 linker; 

Figure 13C shows the amino acid sequence of the PAP318 
linker and the wild type ricin linker; 

Figure 14 A shows the nucleotide sequence of the MMP-9 
linker region of p AP319; 
25 Figure 14B shows the nucleotide sequence of the pAP319 

insert containing ridn and the MMP-9 linker; 

Figure 14C shows the amino acid sequence of the PAP319 
linker and the wild type ricin linker; 

Figure 15A shows the nucleotide sequence of the UPA 
30 linker region of pAP320; 

Figure 15B shows the nucleotide sequence of the pAP320 
insert containing ricin and the UPA linker; 

Figure 15C shows the amino add sequence of the PAP320 
linker and the wild type ricin linker; 
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Figure 16A shows the nucleotide sequence of the UPA 
linker region of pAP321; 

Figure 16B shows the nucleotide sequence of the pAP321 
insert containing ricin and the UPA linker; 
5 Figure 16C shows the amino acid sequence of the PAP321 

linker and the wild type ricin linker; 

Figure 17A shows the nucleotide sequence of the UPA 
linker region of p AP322; 

Figure 17B shows the nucleotide sequence of the pAP322 
10 insert containing ridn and the UPA linker; 

Figure 17C shows the amino add sequence of the PAP322 
linker and the wild type ricin linker; 

Figure 18A shows the nucleotide sequence of the MMP-9 
liiiker region of pAP323; 
15 Figure 18B shows the nucleotide sequence of the pAP323 

insert containing ricin and the MMP-9 linker; 

Figure 18C shows the amino acid sequence of the PAP323 
liiJcer and the wild type ridn linker; 

Figure 19A shows the nucleotide sequence of the MMP-9 
20 linker region of pAP324; 

Figure 19B shows the nucleotide sequence of the pAP324 
insert containing ridn and the MMP-9 linker; 

Figure 19C shows the amino acid sequence of the PAP324 
linker and the wild type ridn linker; 
25 Figure 20A shows the nucleotide sequence of the MMP-9 

linker region of pAP325; 

Figure 20B shows the nucleotide sequence of the pAP325 
insert containing ricin and the MMP-9 linker; 

Figure 20C shows the amino acid sequence of the PAP325 
30 linker and the wild type ricin linker; 

Figure 21 is a graph showing the treatment of human 
tumour A431 with PAP304; and 

Figure 22 is a graph showing the/treatment of human 
tumour A431 with PAP305. 
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DETAILED DESCRIPTION OF THE INVENTION 
1, Nucleic Acid Molecules of the Invention 

As mentioned above, the present invention relates to 
isolated and purified nucleic acid molecules encoding linker sequences. The 
5 present invention also relates to isolated and purified nucleic acid 
molecules encoding a recombinant toxic protein comprising (a) a 
nucleotide sequence encoding an A chain of a ricin-like toxin, (b) a 
nucleotide sequence encoding a B chain of a ricin- like toxin and (c) a 
nucleotide sequence encoding a linker amino acid sequence of the 
10 invention, linking the A and B chains. The heterologous linker sequence 
rn contains a cleavage recognition site for a cancer-specific protease. 

P The term "isolated and purified" as used herein refers to a 

nucleic acid substantially free of cellular material or culture medium when 
J: produced by recombinant DNA techniques, or chemical precursors, or 

□ 15 other chemicals when chemically synthesized. An "isolated and purified" 
T nucleic acid is also substantially free of sequences which naturally flank the 
=«J nucleic acid (Le. sequences located at the 5' and 3' ends of the nucleic acid) 

from which the rmcleic acid is derived. The term "nucleic acid" is intended 
f to include DNA and RNA and can be either double stranded or single 

□ 20 stranded. 

The term "linker sequence" as used herein refers to an 
internal amino acid sequence within the protein encoded by a nucleic acid 
molecule of the invention which contains residues linking the A and B 
chain of a ricin- like toxin so as to render the A chain incapable of exerting 

25 its toxic effect, for example catalytically inhibiting translation of an 
eukaryotic ribosome. The linker sequences of the invention are 
heterologous to the A and B chain of a ricin-like toxin. By heterologous is 
me8mt that the linker sequence is not a sequence native to the A or B chain 
of a ricin-like toxin or precursor thereof. However, preferably, the linker 

30 sequence may be of a similar length to the linker sequence of a ricin-like 
toxin and should not interfere with the role of the B chain in cell binding 
and transport into the cytoplasm. When the linker sequence is cleaved the 
A chain becomes active or toxic. 
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The nucleic acid molecule of the invention encoding a 
recombinant toxic protein is cloned by subjecting a preproricin cDNA clone 
to site-directed mutagenesis in order to generate a series of variants 
differing only in the sequence between the A and B chains (linker region). 
Oligonucleotides, corresponding to the extreme 5* and 3* ends of the 
preproricin gene are synthesized and used to PCR amplify the gene. Using 
the cDNA sequence for preproricin (Lamb et al., Eur. /. Biochem. 145:266- 
270 (1985)), several oligonucleotide primers are designed to flank the start 
and stop codons of the preproricin open reading frame. 

The preproricin cDNA is amplified using the upstream 
primer Ricin-99 or Ricin- 109 and the downstream primer Ricinl729C with 
Vent DNA pol5rmerase (New England Biolabs) using standard procedures 
(Sambrook et aL, Molecular Cloning: A Laboratory Manual, Second 
Edition, (Cold Spring Harbor Laboratory Press, 1989)). The purified PCR 
fragment encoding the preproricin cDNA is, then ligated into an Eco RV- 
digested pBluescript 11 SK plasmid (Stratagene), and is used to transform 
competent XLl-Blue cells (Stratagene). The cloned PCR product containing 
the putative preproricin gene is confirmed by DNA sequencing of the 
entire cDN A clone: 

The preproricin cDNA clone is subjected to site directed 
mutagenesis; in order to generate a series of variants differing only in the 
sequence between the A and B chains (linker region). The wild-type 
preproricin linker region is replaced with the heterogenous linker 
sequences that are cleaved by the various cancer-specific proteases. 

The linker regions of the variants encode a cleavage 
recognition sequence for a cancer-specific prote<ise. The mutagenesis and 
cloning strategies used to generate the cancer-specific protease-sensitive 
linker variants are summarized in WO 98149311 to the present inventor. 
Briefly, the first step involves a DNA amplification using a set of mutagenic 
primers in combination with the two flanking primers Ricin-109Eco and 
Ridnl729C Pst L Restriction digested PCR fragments are gel purified and 
then ligated with PVL1393 which has been digested with Eco RI and Pst 1. 
Ligation reactions are used to transform competent XLI-Blue cells 
(Stratagene). Recombinant clones are identified by restriction digests of 
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plasmid miniprep, DNA and the mutant linker sequences are confirmed by 
DNA sequencing. 

The nucleotide sequences of the novel linker sequences of 
the invention are as follows: the nucleic acid sequence of pAP301 is shown 
5 in Figure 1 A; the nucleic acid sequence of pAP302 is shown in Figure 2A; 
ti\e nucleic acid sequence of pAP303 is shown in Figure 3A; the nucleic acid 
sequence of pAP304 is shown in Figure 4A; the nucleic acid sequence of 
pAF305 is shown in Figure 5A; the nucleic acid sequence of pAP308 is 
shown in Figure 6A; the nucleic acid sequence of pAP309 is shown in 
10 Figure 7A; the nucleic acid sequence of pAP313 is shown in Figure 8A; the 
rji nucleic acid sequence of pAP314 is shown in Figure 9A; the nucleic acid 

J^i sequence of pAP315 is shown in Figure lOA; the nucleic acid sequence of 

^1 pAP316 is shown in Figvire llA; the nucleic acid sequence of pAP317 is 

J shown in Figure 12A; the nucleic acid sequence of pAP318 is shown in 

15 Figure 13A; the nucleic acid sequence of pAP319 is shown in Figure 14A; 
the nucleic acid sequence of pAP320 is shown in Figure 15A; the nucleic 
add sequence of pAP321 is shown in Figure 16A; the nucleic add sequence 
of pAP322 is shown in Figure 17A; the nudeic acid sequence of p AP323 is 
shown in Figure 18A; the nucleic acid sequence of pAP324 is shown in 
20 Figure 19 A; and the nucleic acid sequence of pAP325 is shown in Figure 
20A. 

The nucleic acid molecule encoding a recombinant protein 
of the invention has sequences encoding an A chain of a ricin-like toxin, a B 
chain of a ricin-like toxin and a heterologous linker sequence containing a 
25 cleavage recognition site for a cancer-specific protease as described above. 
The nucleotide sequences encoding the recombinant proteins of the 
invention are shown in Figures 1B-20B. The nucleic add may be expressed 
to provide a recombinant protein having an A chain of a ridn-like toxin, a 
B chain of a ricin-like toxin and a heterologous linker sequence containing a 
30 cleavage recognition site for a disease-specific protease. 

The nucleic acid molecxile may comprise the A and /or B 
chain of ricin. The ricin gene has been cloned and sequenced, and the X- 
ray crystal structures of the A and B chains are published (Rutenber, E., et 
al. Proteins 10:240-250 (1991); Weston et al., Mol Biol 244:410-422 (1994); 
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Lamb and Lord, Eur, J. Biochem. 14:265 (1985); Hailing, K., et aL, Nucleic 
Acids Res. 13:8019 (1985)). It wUl be appreciated that the invention includes 
nucleic acid molecules encoding truncations of A and B chains of ricin like 
proteins and analogs and homologs of A and B chains of ricin-like proteins 
5 and truncations thereof (i.e., ricin-like proteins), as described herein. It will 
further be appreciated that variant forms of the nucleic add molecules of 
the invention which arise by alternative splicing of an mRNA 
corresponding to a cDNA of the invention are encompassed by the 
invention. 

10 Another aspect of the invention provides a nucleotide 

sequence which hybridizes under high stringency conditions to a 
nucleotide sequence encoding the A and /or B chains of a ricin-like protein. 
Appropriate stringency conditions which promote DNA hybridization are 
known to those skilled in the art, or can be found in Current Protocols in 
15 Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1 6.3.6. For 
example, 6.0 x sodium chloride /sodium citrate (SSC) at about 45**Cy 
followed by a wash of 2.0 x SSC at 50^C may be employed. 

The stringency may be selected based on the conditions 
used in the wash step. By way of example, the salt concentration in the 
20 wash step can be selected from a high stringency of about 0.2 x SSC at 
50°C. In addition, the temperature in the wash step can be at high 
stringency conditions, at about 65^C. 

The nucleic acid molecule may comprise the A and/or B 
chain of a ridn-like toxin. Methods for cloning ricin-like toxins are known 
25 in the art and are described, for example, in E.P. 466,222. Sequences 
encoding ricin or ricin-like A and B chains may be obtained by selective 
amplification of a coding region, using sets of degenerative primers or 
probes for selectively amplifying the coding region in a genomic or cDNA 
library. Appropriate primers may be selected from the nucleic add 
30 sequence of A and B chains of ricin or ricin-like toxins. It is also possible to 
design synthetic oligonucleotide primers from the nucleotide sequences for 
use in PCR. Suitable primers may be selected from the sequences 
encoding regions of ricin-like proteins which are highly conserved, as 
described for example in U.S. Patent No 5,101,025 and E.P. 466,222. 
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A nucleic acid can be amplified from cDNA or genomic 
DNA using these oligonucleotide primers and standard PCR 
amplification techniques. The nucleic acid so amplified can be cloned into 
an appropriate vector and characterized by DNA sequence analysis. It wiU 

5 be appreciated that cDNA may be prepared from mRNA, by isolating total 
cellular mRNA by a variety of techniques, for example, by using the 
guanidinium-thiocyanate extraction procedure of Chirgwin et al.. 
Biochemistry 18, 5294-5299 (1979). cDNA is then synthesized from the 
mRNA using reverse transcriptase (for example, Moloney MLV reverse 

10 transcriptase available from Gibco/BRL, Bethesda, MD, or AMV reverse 
transcriptase available from Seikagaku America, Inc., St. Petersburg, FL). 
It will be appreciated that the methods described above may be used to 
obtain the coding sequence from plants, bacteria or fungi, preferably 
plants, which produce known ricin-like proteins and also to screen for the 

15 presence of genes encoding as yet unknown ricin-like proteins. 

A sequence containing a cleavage recognition site for a 
specific protease may be selected based on the cancer which is to be 
targeted by the recombinant protein. The cleavage recognition site may 
be selected from sequences known to encode a cleavage recognition site 

20 for the cancer to be treated. Sequences encoding cleavage recogrution sites 
may be identified by testing the expression product of the sequence for 
susceptibility to cleavage by the respective protease. A polypeptide 
containing the suspected cleavage recognition site may be incubated with a 
protease and the amount of cleavage product determined (Dilannit, 1990, J. 

25 BioL Chem. 285: 17345-17354 (1990)). The protease may be prepared by 
methods known in the art and used to test suspected cleavage recognition 
sites. 

The nucleic add molecule of the invention may be prepared 
by site directed mutagenesis. For example, the cleavage site of a cancer- 
30 specific protease may be prepared by site directed mutagenesis of the 
homologous linker sequence of a proricin-like toxin. Procedures for 
cloning proricin- like genes, encoding a linker sequence are described in EP 
466,222. Site directed mutagenesis may be accomplished by DNA 
cimplification of mutagenic primers in combination with flanking primers. 
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The nucleic acid molecule of the invention may also encode 
a fusion protein. A sequence encoding a heterologous linker sequence 
containing a cleavage recognition site for a cancer-specific protease may be 
cloned from a cDNA or genomic library or chemically synthesized based 
on the known sequence of such cleavage sites. The heterologous lir\ker 
sequence may then be fused in frame with the sequences encoding the A 
and B chedns of the ricin-like toxin for expression as a fusion protein. It will 
be appreciated that a nucleic acid molecule encoding a fusion protein may 
contain a sequence encoding an A chain and a B chain from the same ricin- 
like toxin or the encoded A and B chains may be from different toxins. For 
example, the A chain may be derived from ricin and the B chain may be 
derived from abrin. A protein may also be prepared by chemical 
conjugation of the A and B chains and linker sequence using conventional 
coupling agents for covalent attachment. 

An isolated and purified nucleic acid molecule of the 
invention which is RNA can be isolated by cloning a cDNA encoding an A 
and B chain and a linker into an appropriate vector which allows for 
transcription of the cDNA to produce an RNA molecule which encodes a 
protein of the invention. For example, a cDNA can be cloned downstream 
of a bacteriophage promoter, (e.g. a T7 promoter) in a vector, cDNA can 
be transcribed in vitro with T7 polymerase, and the restiltant RNA can be 
isolated by standard techniques. 

II. Novel Linkers and Recombinant Proteins of the Invention 

As previously mentioned, the invention provides novel 
linker sequences. Preferably, the amino acid sequence of the linker is 
selected from: the amino acid sequence of P AP301 as shown in Figure IC; 
the amino acid sequence of PAP302 as shown in Figure 2C; the amino add 
sequence of PAP303 as shown in Figure 3C; the amino acid sequence of 
PAP304 as shown in Figure 4C; the amino acid sequence of PAP305 as 
shown in Figure 5C; the amino acid sequence of PAP308 as shown in 
Figure 6C; the amino acid sequence of PAP309 as shown in Figure 7C; the 
amino acid sequence of PAP313 as shown in Figure 8C; the amino acid 
sequence of PAP314 as shown in Figure 9C; the amino acid sequence of 
PAP315 as shown in Figure IOC; the amino acid sequence of PAP316 as 
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shown in Figure IIC; the amino acid sequence of PAP317 as shown in 
Figure 12C; the amino acid sequence of PAP318 as shown in Figure 13C; 
the amino acid sequence of PAP319 as shown in Figure 14C; the amino acid 
sequence of PAP320 as shown in Figure 15C; the amino acid sequence of 
5 PAP321 as shown in Figtire 16C; the amino acid sequence of PAP322 as 
shown in Figure 17C; the amino acid sequence of PAP323 as shown in 
Figure 18C; the amino acid sequence of PAP324 as shown in Figure 19C; 
and the amino add sequence of PAP325 as shown in Figtire 20C. 

The present invention also provides recombinant proteins 
10 which incorporate the A and B chains of a ricin like toxin linked by a 
heterologous linker sequence containing a cleavage recognition site for a 
cancer-specific protease as described above- It is an advantage of the 
recombinant proteins of the invention that they are non-toxic imtil the A 
chain is liberated from the B chain by specific cleavage of the linker by the 
15 target protease. 

ai The recombinant protein may be used to specifically target 

cancer cells in the absence of additional specific cell-binding components to 
target cancer cells. It is a further advantage that the cancer-specific 
protease cleaves the heterologous linker intracellularly thereby releasing 
20 the toxic A chain directly into the cytoplasm of the cancer cell. As a result, 
said cells are specifically targeted and normal cells are not directly exposed 
to the activated free A chain. 

Ricin is a plant derived ribosome inhibiting protein which 
blocks protein synthesis in eukaryotic cells. Ricin may be derived from the 
25 seeds of Ricinus communis (castor oil plant). The ricin toxin is a 
glycosylated heterodimer with A and B chain molecular masses of 30,625 
Da and 31,431 Da respectively. The A chain of ricin has an N-glycosidase 
activity and catalyzes the excision of a specific adenine residue from the 28S 
rRNA of eukaryotic ribosomes (Endo, Y; & Tsurugi, K. J. Biol. Chem. 
30 262:8128 (1987)). The B chain of ricin, although not toxic in itself, promotes 
the toxicity of the A chain by binding to galactose residues on the surface 
of eukaryotic cells and stimtdating receptor-mediated endocytosis of the 
toxin molecule (Simmons et al., Biol. Chem. 261:7912 (1986)). 
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All protein toxins are initially produced in an inactive, 
precursor form. Ricin is initially produced as a single pol5rpeptide 
(preproricin) with a 35 amino acid N-terminal presequence and 12 amino 
acid linker between the A and B chains. The pre-sequence is removed 
during translocation of the ridn precursor into the endoplasmic reticulum 
(Lord, J.M., Eur. J. Biochem. 146:403-409 (1985) and Lord, J.M., Eur. J. 
Biochem, 146:411-416 (1985)). The proricin is then translocated into 
specialized organelles called protein bodies where a plant protease cleaves 
the protein at a linker region between the A and B chains (Lord, J.M. et al., 
FASAB journal 8:201-208 (1994)). The two chains, however, remain 
covalently attached by an interchain disulfide bond (cysteine 259 in the A 
chain to cysteine 4 in the B chain) and mature disulfide linked ricin is stored 
in protein bodies inside plant cells. The A chain is inactive in the proricin 
(O'Hare, M., et aL, FEBS Lett. 273:200-204 (1990)) and it is inactive in the 
disulfide-linked mature ricin (Richardson, PT. et al., FEBS Lett. 255:15-20 
(1989)). The ribosomes of the castor bean plant are themselves susceptible 
to inactivation by ricin A chain; however, as there is no cell surface 
galactose to permit B chain recognition the A chain cannot re-enter the cell. 

Ricin-like proteins include, but are not limited to, bacterial, 
fungal and plant toxins which have A and B chains and inactivate 
ribosomes and irJiibit protein synthesis. The A chain is an active 
polypeptide subunit which is responsible for the pharmacologic effect of 
the toxin. In most cases the active component of the A chain is an enzyme. 
The B chain is responsible for binding the toxin to the cell sxirface ^xyd is 
thought to facilitate entry of tiKe A chain into the cell cytoplasm. The A and 
B chains in the mature toxins are linked by disulfide bonds. The toxins 
most similar in structure to ricLn are plant toxins which have one A chain 
and one B chain. Examples of such toxins include abrin which may be 
isolated from the seeds of Abrus precatorius, modeccin, volkensin and 
viscumin. 

Ricin-like bacterial proteins include diphtheria toxin, which 
is produced by Corynebacterium diphtheriae, Pseudomanas enterotoxin A 
and cholera toxin. It will be appreciated that the term ricin-like toxins is 
also intended to include the A chain of those toxix\s which have only an A 
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chain. The recombinant proteins of tihe invention covild include the A chain 
of these toxins conjugated to, or expressed as, a recombinant protein with 
the B chain of another toxin. Examples of plant toxins having only an A 
chain include trichosanthin, MMC and pokev^eed antiviral proteins, 
5 dianthin 30, dianthin 32, crotin n, curcin 11 and wheat germ inhibitor. 
Examples of fungal toxins having only an A chain include alpha-sarcin, 
restrictocin, mitogillin, enomycin, phenomycin. Examples of bacterial 
toxins having only an A chain include cytotoxin from Shigella dysenteriae 
and related Shiga-like toxins. Recombinant trichosanthin and the coding 

10 sequence thereof is disclosed in U.S. Patents 5,101,025 and 5,128,460. 

In addition to the entire A or B chains of a ricin-like toxin, it 
will be appreciated that the recombinant protein of the invention may 
contain only that portion of the A chain which is necessary for exerting its 
cytotoxic effect. For example, the first 30 amino acids of the ricin A chain 

15 may be removed resulting in a trimcated A chain which retains toxic 
activity. The truncated ricin or ricin-like A chain may be prepared by 
expression of a truncated gene or by proteolytic degradation, for example 
with Nagarase (Funmatsu et al., Jap. J. Med. Set BioL 23:264-267 (1970)). 
Similarly, the recombinant protein of the invention may contain only that 

20 portion of the B chain necessary for galactose recognition, cell binding and 
transport into the cell cytoplasm. Trimcated B chains are described for 
example in E.F. 145,111. The A and B chains may be glycosylated or non- 
glycosylated. Glycosylated A and B chains may be obtained by expression 
in the appropriate host cell capable of glycosylation. Non-glycosylated 

25 chains may be obtained by expression in nonglycosylating host cells or by 
treatment to remove or destroy the carbohydrate moieties. 

The proteins of the invention may be prepared using 
recombinant DNA methods.. Accordingly, the nucleic acid molecules of the 
present invention may be incorporated in a known manner into an 

30 appropriate expression vector which ensures good expression of the 
protein. Fossible expression vectors include but are not limited to cosmids, 
plasmids, or modified viruses (e.g. replication defective retroviruses, 
adenoviruses and adeno-associated viruses), so long as the vector is 
compatible with the host cell used. The expression vectors are "suitable for 
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transformation of a host cell", which means that the expression vectors 
contain a nucleic acid moleaile of the invention and regulatory sequences 
selected on the basis of the host cells to be used for expression, which is 
operatively linked to the nucleic acid molecule. Operatively linked is 
5 intended to mean that the nucleic acid is linked to regulatory sequences in 
a manner which allows expression of the nucleic add. 

The invention therefore contemplates a recombinant 
expression vector of the invention containing a nucleic acid molecule of the 
invention, or a fragment thereof, and the necessary regulatory sequences 
10 for the transcription and translation of the inserted protein-sequence. 

Suitable regulatory sequences may be derived from a 30 
ji? variety of sources, including bacterial, fungal, viral, mammalian, or insect 

K* genes (For example, see the regulatory sequences described in Goeddel, 

Gene Expression Technology: Methods in Enzymology 185, Academic 
15 Press, San Diego, CA (1990). Selection of appropriate regulatory sequences 
jig is dependent on the host cell chosen as discussed below, and may be 

readily accomplished by one of ordinary skill in the art. Examples of such 
regulatory sequences include: a transcriptional promoter and enhancer or 
RNA polymerase . binding sequence, a ribosomal binding sequence, 
Q 20 including a translation initiation signal. Additionally, depending on the 

host cell chosen and the vector employed, other sequences, such as an 
origin of replication, additional DNA restriction sites, enhancers, and 
sequences conferring inducibility of transcription may be incorporated into 
the expression vector. It will also be appreciated that the necessary 
25 regulatory sequences may be supplied by the native A and B chains 
and/or its flanking regions. 

The recombinant expression vectors of the invention may 
also contain a selectable marker gene which facilitates the selection of host 
cells transformed or transfected with a recombinant molecule of the 
30 invention. Examples of selectable marker genes are genes encoding a 
protein such as G418 and hygromycin which confer resistance to certain 
drugs, P-galactosidase, chloramphenicol acetyltransferase, firefly luciferase, 
or an immunoglobulin or portion thereof such as the Fc portion of an 
immunoglobulin preferably IgG. Transcription of the selectable marker 
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gene is monitored by changes in the concentration of the selectable marker 
protein such as p- galactosidase, chloramphenicol acetyltransferase, or 
firefly luciferase. If the selectable marker gene encodes a protein 
conferring antibiotic resistance such as neomycin resistance transformant 
5 cells can be selected with G418. Cells that have incorporated the selectable 
marker gene will survive, while the other cells die. This makes it possible 
to visualize and assay for expression of recombinant expression vectors of 
the invention and in particular to determine the effect of a mutation on 
expression and phenotype. It will be appreciated that selectable markers 
10 can be introduced on a separate vector from the nucleic acid of interest. 

The recombinant expression vectors may also contain genes 
%^ which encode a fusion moiety which provides increased expression of the 

hi recombinant protein; increased solubility of the recombinant protein; and 

aid in the purification of the target recombinant protein by acting as a 
15 ligand in affinity purification. For example, a proteolytic cleavage site may 
be added to the target recombinant protein to allow separation of the 
recombinant protein from the fusion moiety subsequent to purification of 
the fusion protein. Typical fusion expression vectors include pGEX (Amrad 
Corp., Melbourne, Australia), pMA.L (New England Biolabs, Beverly, MA) 
20 and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S- 
transferase (GST), maltose E binding protein, or protein A, respectively, to 
the recombinant protein. 

Recombinant expression vectors can be introduced into host 
cells to produce a transformed host cell. The term "transformed host cell" 
25 is intended to include prokaryotic and eukaryotic cells which have been 
transformed or transfected with a recombinant expression vector of the 
invention. The terms "transformed with", "transfected with", 
"transformation" and "transfection" are intended to encompass 
introduction of nucleic acid (e.g. a vector) into a cell by one of many 
30 possible techniques known in the art. Prokaryotic cells can be transformed 
with nucleic acid by, for example, electroporation or calcitun-chloride 
mediated transformation. Nucleic acid can be introduced into mammalian 
cells via conventional techniques such as calcium phosphate or calcium 
chloride co-precipitation, DEAE-dextran mediated transfection, lipofectin. 
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electroporation or microinjection. Suitable methods for transforming and 
transfecting host cells can be found in Sambrook et al. (Molecular Cloning: 
A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory press 
(1989)), and other laboratory textbooks. 
5 Suitable host cells include a wide variety of prokaryotic and 

eukaryotic host cells. For example, the proteins of the invention may be 
expressed in bacterial cells such as E. coli, insect cells (using baculovirus), 
yeast cells or mammalian cells. Other suitable host cells can be foimd in 
Goeddel, Gene Expression Technology: Methods in Enzymology 185, 

10 Academic Press, San Diego, CA (1991). 

More particularly, bacterial host cells suitable for carrying 
out the present invention include E. coli, B. subtilis. Salmonella typhimurium, 
and various species within the genus' Pseudomonas, Streptomyces, and 
Staphylococcus, as well as many other bacterial species well known to one 

15 of ordinary skill in the art. Suitable bacterial expression vectors preferably 
comprise a promoter which functions in the host cell, one or more 
selectable phenotypic markers, and a bacterial origin of replication. 
Representative promoters include the p-lactamase (penicillinase) and 
lactose promoter system (see Chang et al.. Nature 275:615 (1978)), the trp 

20 promoter (Nichok and Yanofsky, Meth in Enzymology 101:155, (1983) and 
the tac promoter (Russell et aL, Gene 20:231, (1982)), Representative 
selectable markers include various antibiotic resistance markers such as the 
kanamycin or ampicillin resistance genes. Suitable expression vectors 
include but are not limited to bacteriophages such as lambda derivatives or 

25 plasmids such as pBR322 (Bolivar et al.. Gene 2:9S, (1977)), the pUC 
plasmids pUC18, pUC19, pUCllS, pUC119 (see Messing, Meth in 
Enzymology 101:20-77, 1983 and Vieira and Messing, Gene 19:259-268 

(1982)), and pNH8A, pNH16a, pNH18a, and Bluescript M13 (Stratagene, La 
JoUa, Calif.). 

30 Typical fusion expression vectors which may be used are 

discussed above, e.g. pGEX (Amrad Corp., Melbourne, Australia), pMAL 
(New England Biolabs, Beverly, MA) and pRrr5 (Pharmacia, Piscataway, 
NJ). Examples of inducible non-fusion expression vectors include pTrc 
(Amann et al.. Gene 69:301-315 (1988)) and pET lid (Studier et aL, Gene 
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Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, California, 60-89 (1990)). 

Yeast and fungi host cells suitable for carrying out the 
present invention include, but are not limited to Saccharomyces cerevisae, the 

5 genera Pichia or Kluyveromyces and various species of the genus Aspergillus, 
Examples of vectors for expression in yeast S. cerivisae include pYepSecl 
(Baldari. et al., Embo J. 6:229-234 (1987)), pMFa (Kurjan and Herskowitz, 
Cell 30:933-943 (1982)), pJRY88 (Schultz et al.. Gene 54:113-123 (1987)), and 
pYES2 (Invitrogen Corporation, San Diego, CA). Protocols for the 

10 transformation of yeast and fungi are well known to those of ordinary skill 
in the art (see Hinnen et al., Proc. Natl.Acad. ScL USA 75:1929 (1978); Itoh et 
al., /. Bacteriology 153:163 (1983), and Cullen et al. (BiolTechnology 5:369 
(1987))- 

Mammalian cells suitable for carrying out the present 

15 invention include, among others: COS (e.g., ATCC No. CRL 1650 or 1651), 
BHK (e.g. ATCC No. CRL 6281), CHO (ATCC No. CCL 61), HeLa (e.g., 
ATCC No. CCL 2), 293 (ATCC No. 1573) and NS-1 cells. Suitable 
expression vectors for directing expression in mammalian cells generally 
include a promoter (e.g., derived from viral material such as polyoma, 

20 Adenovirus 2, cytomegalovirus and Simian Virus 40), as well as other 
transcriptional and translational control sequences. Examples of 
mammalian expression vectors include pCDM8 (Seed, B., Nature 329:840 
(1987)) and pMT2PC (Kaufman et al., EMBO /. 6:187-195 (1987)). 

Given the teachings provided herein, promoters, 

25 terminators, and methods for introducing expression vectors of an 
appropriate type into plant, avian, and insect cells may also be readily 
accomplished. For example, within one embodiment, the proteins of the 
invention may be expressed from plcmt cells (see Sinkar et al., /. Biosci 
(Bangalore) 11:47-58 (1987), which reviews the use of Agrobacterium 

30 rhizogenes vectors; see also Zambryski et aL, Genetic Engineering, 
Principles and Methods, Hollaender and Setlow (eds.), Vol. VI, pp. 253-278, 
Plenxmi Press, New York (1984), which describes the use of expression 
vectors for plant cells, including, among others, PAPS2022, PAPS2023, and 
PAPS2034) 
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Insect cells smtable for carrying out the present invention 
include cells and cell lines from Bombyx, Trichoplusia or Spodotera species. 
Baculovirus vectors available for expression of proteins in cultured insect 
cells (SF 9 cells) include the pAc series (Smith et al., Mol Cell Biol 3:2156- 
2165 (1983)) and the pVL series (Lucklow, V.A., and Summers, M.D., 
Virology 170:31-39 (1989)). Some baculovirus-insect cell expression 
systems suitable for expression of the recombinant proteins of the 
invention are described in PCT/US/02442. 

Alternatively, the proteins of the invention may also be 
expressed in non-human transgenic animals such as, rats, rabbits, sheep 
and pigs (Hammer et al. Nature 315:680-683 (1985); Palmiter et al. Science 
222:809-814 (1983); Brinster et al. Proc. Natl Acad. Scl USA 82:4438-4442 
(1985); Palmiter and Brinster Cell 41:343-345 (1985) and U.S. Patent No, 
4,736,866). 

The proteins of the invention may also be prepared by 
chemical synthesis using techniques well knov/n in the chemistry of 
proteins such as solid phase synthesis (Merrifield, /. Am. Chem. Assoc. 
85:2149-2154 (1964)) or synthesis in homogenous solution (Houbenweyl, 
Methods of Organic Chemistey; ed. E. Wansch, Vol. 15 I and II, Thieme, 
Stuttgart (1987)). 

The present invention also provides proteins comprising an 
A chain of a ricin-like toxin, a B chain of a ricin-like toxin and a 
heterologous linker amino acid sequence liiOcing the A and B chains, 
wherein the linker sequence contains a cleavage recognition site for a 
cancer-specific protease. Such a protein could be prepared other than by 
recombinant means, for example by chemical synthesis or by conjugation 
of A and B chains and a linker sequence isolated and purified from their 
natural plant, fungal or bacterial source. Such A and B chains could be 
prepared having the glycosylation pattern of the native ricin-like toxin. 

N-terminal or C-terminal fusion proteins comprising the 
protein of the invention conjugated with other molecules, such as proteins 
may be prepared by fusing, through recombinant techniques. The 
resultant fusion proteins contain a protein of the invention fused to the 
selected protein or marker protein as described herein. The recombinant 
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protein of the invention may also be conjugated to other proteins by 
known techniques. For example, the proteins may be coupled using 
heterobifunctional thiol-containing linkers as described in WO 90/10457, 
N-succinimidyl-3-(2-pyridyldithio-proprionate) or N-succinimidyl-5 
5 thioacetate. Examples of proteins which may be used to prepare fusion 
proteins or conjugates include cell binding proteins such as 
immunoglobulins, hormones, growth factors, lectins, insulin, low density 
lipoprotein, glucagon, endorphins, transferrin, bombesin, 
asialoglycoprotein glutathione-S-transferase (GST), hemagglutinin (HA), 

10 and trimcated myc. 

III. Utility of the Nucleic Acid Molecules and Proteins of the Invention 
(a) Therapeutic Methods 

The recombinant proteins of the invention may be used to 
specifically inhibit or destroy cancer cells that contain a protease that can 

15 cleave the linker sequence of the recombinant protein. It is an advantage 
of the recombinant proteins of the invention that they have specificity for 
the cancer cells without the need for a cell binding component. The ricin- 
like B chain of the recombinant proteins recognize galactose moieties on 
the cell surface and ensure that the protein is taken up by the cancer cell 

20 and released into the cytoplasm. When the protein is internalized into a 
normal cell cleavage of the heterologous linker would not occur in the 
absence of the cancer-specific protease and the A chain will remain inactive 
bound to the B chain. Conversely, when the protein is internalized into a 
cancer cell, the cancer-specific protease will cleave the cleavage recognition 

25 site in the linker thereby releasing the toxic A chain. 

Accordingly, the present invention provides a method of 
inhibiting or destroying cancer cells comprising contacting cancer cells with 
an effective amoimt of a recombinant protein or a nucleic acid molecule 
encoding a recombinant protein of the invention. The present invention 

30 also provides a method of treating a cancer comprising administering an 
effective amount of a recombinant protein or a nucleic acid molecule 
encoding a recombinant protein of the invention to an ardmal in need 
thereof. 
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The term "effective amoiint" as used herein means an 
amount effective, at dosages and for periods of time necessary to achieve 
the desired result. 

The term "animal" as used herein means any member of the 
animal kingdom including all mammals, birds, fish, reptiles and 
amphibians. Preferably, the animal to be treated is a mammal, more 
preferably a human. 

The specificity of a recombinant protein of the invention 
may be tested by treating the protein with the cancer-specific protease 
which is thought to be specific for the cleavage recognition site of the 
linker and assaying for cleavage products. Cancer-specific proteases may 
be isolated from cancer cells, or they may be prepared recombinantly, for 
example following the procedures in Darket et al. Q. BioL Chem. 254:2307- 
2312 (1988)). The cleavage products may be identified for example based 
on size, antigenicity or activity. The toxicity of the recombinant protein 
may be investigated by subjecting the cleavage products to an in vitro 
translation assay in cell lysates, for example using Brome Mosaic Virus 
mRNA as a template. Toxicity of the cleavage products may be 
determined using a ribosomal inactivation assay (Westby et al., 
Bioconjugate Chem. 3:377-382 (1992)). The effect of the cleavage products on 
protein synthesis may be measured in standardized assays of in vitro 
translation utilizing partially defined cell free systems composed for 
example of a reticulocyte lysate preparation as a sotirce of ribosomes and 
various essential cofactors, such as mRNA template and amino acids. Use 
of radiolabelled amino acids in the mixture allows quantitation of 
incorporation of free amino acid precursors into trichloroacetic acid 
precipitable proteins. Rabbit reticulocyte lysates may be conveniently used 
(O'Hare, FEBS Lett, 273:200-204 (1990)). 

The ability of the recombinant proteins of the invention to 
selectively inhibit or destroy cancer cells may be readily tested in vitro 
using cancer cell lines. The selective inhibitory effect of the recombinant 
proteins of the invention may be determined, for example, by 
demonstrating the selective inhibition cellular proliferation in cancer cells 
or infected cells. 
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Toxicity may also be measured based on cell viability, for 
example the viability of cancer and normal cell cultures exposed to the 
recombinant protein may be compared. Cell viability may be assessed by 
known techniques, such as trypan blue exclusion assays. 
5 In another example, a number of models may be used to 

test the cytotoxicity of recombinant proteins having a heterologous linker 
sequence containing a cleavage recognition site for a cancer associated 
matrix metalloprotease. Thompson, E.W. et al. (Breast Cancer Res, 
Treatment 31:357-370 (1994)) has described a model for the determination 

10 of invasiveness of human breast cancer cells in vitro by measuring tumour 
ceD-mediated proteolysis of extracellular matrix and tumour cell invasion 
of reconstituted basement membrane (collagen, laminin, fibronectin, 
Matrigel or gelatin). Other applicable cancer cell models include cultured 
ovarian adenocarcinoma, cells (Yoimg, T.N. et al. Gynecol. Oncol. 62:89-99 

15 (1996); Moore, D.H. et al. Gynecol Oncol 65:78-82 (1997)), human follicular 
thyroid cancer cells (Demeure, M.J. et al., Viorld /. Surg. 16:770-776 (1992)), 
human melanoma (A-2058) and fibrosarcoma. (HT-1080) cell lines 
(Mackay, A.R et al. La&. Invest 70:781 783 (1994)), and limg squamous (HS- 
24) cind adenocarcinoma (SB-3) cell lines (Spiess, E. et al. J. Histochem. 

20 Cytochem, 42:917-929 (1994)). An in vivo test system involving the 
implantation of tumours and measurement of tumour growth and 
metastasis in a thymic nude mice has also been described (Thompson, E.W. 
et al.. Breast Cancer Res. Treatment 31:357-370 (1994); Shi, Y.E. et al.. Cancer 
Res, 53:1409-1415 (1993)). 

25 Although the primary specificity of the proteins of the 

invention for cancer cells is mediated by the specific cleavage of the 
cleavage recognition site of the linker, it will be appreciated ttiat specific cell 
binding components may optionally be conjugated to the proteins of the 
invention. Such cell binding components may be expressed as fusion 

30 proteins with the proteins of the invention or the cell binding component 
may be physically or chemically coupled to the protein component. 
Examples of suitable cell binding components include antibodies to cancer 
proteins. 
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Antibodies having specificity for a cell surface protein may 
be prepared by conventional methods. A mammal, (e.g. a mouse, 
hamster, or rabbit) can be immimized with an immimogenic form of the 
peptide which elicits cin antibody response in the mammal. Techniques for 
5 conferring immunogenicity on a peptide include conjugation to carriers or 
other techniques well known in the art. For example, the peptide can be 
administered in the presence of adjuvant. The progress of immunization 
can be monitored by detection of antibody titers in plasma or serum. 
Standard ELISA or other immunoassay procedures can be used with the 
10 immimogen as antigen to assess the levels of antibodies. Following 
^ immunization, antisera can be obtained and, if desired, polyclonal 

□ antibodies isolated from the sera. 

^'"l! To produce monoclonal antibodies, antibody producing 

Si cells (Ijrmphocytes) can be harvested from an immimized cmimal and fused 

irU 15 with myeloma cells by standard somatic cell fusion procedures thus 

immortalizing these cells and yielding hybridoma cells. Such techniques 
S«i are well known in the art, (e.g. the hybridoma technique originally 

4- developed by Kohler and Milstein (Nature 256:495-497 (1975)) as well as 

^.p Other techniques such as the human B-cell hybridoma techruque (Kozbor 

g 20 et aL, ImmunoLToday 4:72 (1983)), the EBV-hybridoma technique to 

produce human monoclonal antibodies (Cole et al.. Monoclonal Antibodies 
in Cancer Therapy Allen R., Bliss, Inc., pages 77-96 (1985)), and screening of 
combinatorial antibody libraries (Huse et al.. Science 246:1275 (1989)). 
Hybridoma cells can be screened immunochemically for production of 
25 emtibodies specifically reactive with the peptide and the monoclonal 
antibodies can be isolated. 

The term "antibody" as used herein is intended to include 
fragments thereof which also specifically react with a cell surface 
component. Antibodies can be fragmented using conventional techniques 
30 and the fragments screened for utility in the same manner as described 
above. For example, F(ab')2 fragments can be generated by treating 
antibody with pepsin. The resulting F(ab')2 fragment can be treated to 
reduce disulfide bridges to produce Fab* fragments. 
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Chimeric antibody derivatives, i.e., antibody molecules that 
combine a non-human animal variable region and a human constant 
region are also contemplated within the scope of the invention. Chimeric 
antibody molecules can include, for example, the antigen binding domain 
5 from an antibody of a moxise, rat, or other species, with himian constant 
regions. Conventional methods may be used to make chimeric antibodies 
containing the immtmoglobulin variable region which recognizes a cell 
surface antigen (See, for example, Morrison et al., Proc. Natl Acad. Sci, 
US, A, 81:6851 (1985); Takeda et al.. Nature 314:452 (1985), Cabilly et al., 

10 U.S. Patent No. 4,816,567; Boss et al., U.S. Patent No. 4,816,397; Tanaguchi 
et al., E.P. Patent No. 171,496; European Patent No. 173,494, Uruted 
Kingdom Patent No. GB 2177096B). It is expected that chimeric antibodies 
would be less immimogenic in a human subject than the corresponding 
non-chimeric antibody. 

15 Monoclonal or chimeric antibodies specifically reactive 

against cell surface components can be further humanized by producing 
human constant region chimeras, in which parts of the variable regions, 
particularly the conserved framework regions of the antigen-binding 
domain, are of human origin and only the hypervariable regions are of 

20 non-human origin. Such immxmoglobulin molecxiles may be made by 
techniques known in the art, (e.g. Teng et al., Proc. Natl Acad. ScL U.S.A., 
80:7308-7312 (1983); Kozbor et al.. Immunology Today 4:7279 (1983); Olsson 
et al., Meth. Enzymol, 92:3-16 (1982), and PCT Publication W092/06193 or 
EP 239,400). Humanized antibodies can also be commercially produced 

25 (Scotgen Limited, 2 Holly Road, Twickenham, Middlesex, Great Britain.) 

Specific antibodies, or antibody fragments, reactive against 
cell surface components may also be generated by screening expression 
libraries encoding iromunoglobulin genes, or portions thereof, expressed 
in bacteria with cell surface components. For example, complete Fab 

30 fragments, VH regions and FV regions can be expressed in bacteria using 
phage expression libraries (See for example Ward et al.. Nature 341:544-546 
(1989); Huse et al.. Science 246:1275-1281 (1989); and McCafferty et al.. 
Nature 348:552-554 (1990)), Alternatively, a SCID-hu mouse, for example 
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the model developed by Genpharm, can be used to produce antibodies, or 

fragments thereof. 

(b) Pharmaceutical Compositions 

The proteins and nucleic acids of the invention may be 
5 formxxlated into pharmaceutical compositions for adminstration to subjects 
in a biologically compatible form suitable for administration in vivo. By 
"biologically compatible form suitable for administration in vivo" is meant 
a form of the substance to be administered in v^hich any toxic effects are 
outweighed by the therapeutic effects. The substances may be 

10 administered to living organisms including humans, and animals. 
Administration of a therapeutically active amovmt of the pharmaceutical 
compositions of the present invention is defined as an amoimt effective., at 
dosages and for periods of time necessary to achieve the desired result. 
For example, a therapeutically active amotmt of a substance may vary 

15 according to factors such as the disease state, age, sex, and weight of the 
individual, and the ability of antibody to elicit a desired response in the 
individual. Dosage regime may be adjusted to provide the optimum 
therapeutic response. For example, several divided doses may be 
administered daily or the dose may be proportionzdly reduced as indicated 

20 by the exigencies of the therapeutic situation. 

Accordingly, the present invention provides a 
pharmaceutical composition for treating cancer comprising a recombinant 
protein or a nucleic acid encoding a recombinant protein of the invention 
and a pharmaceutically acceptable carrier, diluent or excipient. 

25 The active substance may be administered in a convenient 

manner such as by injection (subcutaneous, intravenous, intramuscular, 
etc.), oral administration, inhalation, transdermal administration (such as 
topical crecun or ointment, etc.)/ or suppository applications. Depending 
on the route of administration, the active substance may be coated in a 

30 material to protect the compound from the action of enzymes, acids and 
other natural conditions which may inactivate the compotmd. 

The compositions described herein can be prepared by per 
se known methods for the preparation of pharmaceutically acceptable 
compositions which can be administered to subjects, such that an effective 
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quantity of the active substance is combined in a mixture with a 
pharmaceutically acceptable vehicle. Suitable vehicles are described, for 
example, in Remington's Pharmaceutical Sciences (Remington's 
Pharmaceutical Sciences, Mack Publishing Company, Easton, Pa., USA 
5 1985). On this basis, the compositions include, albeit not exclusively, 
solutions of the substances in association with one or more 
pharmaceutically acceptable vehicles or diluents, and contained in buffered 
solutions with a suitable pH and iso-osmotic with the physiological fluids. 

The pharmaceutical compositions may be used in methods 

10 for treating animals, including mammals, preferably humarxs, with cancer. 
It is anticipated that the compositions will be particularly useful for treating 
patients with B-cell lymphoproliferative disease and melanoma. The 
dosage and type of recombinant protein to be administered will depend on 
a variety of factors which may be readily monitored in hiunan subjects. 

15 Such factors include the etiology and severity (grade and stage) of the 
neoplasia. 

The following non-limiting examples are illustrative of the 
present invention: 

EXAMPLES 

20 EXAMPLE 1 

Cloning and Expression of Prorici n Variants Activated by Disease 
Specific Proteases 
Isolation of total RNA 

The preproricin gene was cloned from new foliage of the 
25 castor bean plant. Total messenger RNA was isolated according to 
established procedures (Sambrook et al.. Molecular Cloning: A Lab Manual 
(Cold Spring Harbour Press, Cold Spring Harbour, (1989)) and cDNA 
generated using reverse transcriptase. 
cDNA Synthesis 

30 Oligonucleotides, corresponding to the extreme 5' and 3' 

ends of the preproricin gene were synthesized and used to PCR amplify 
the gene. Using the cDNA sequence for preproricin (Lamb et al., Eur, J, 
Biochem., 145:266-270, 1985), several oligonucleotide primers were designed 
to flank the start and stop codons of the preproricin open reading frame. 
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The oligonucleotides were synthesized vising an Applied Biosystems Model 
392 DNA/RNA Synthesizer. First strand cDNA synthesis was primed 
using the oligonucleotide Ricinl729C. Three micrograms of total RNA was 
used as a template for oUgo Ridnl729C (5'-ATAACTTGCTGCTCCTTTCA- 
5 3') primed synthesis of cDNA using Superscript 11 Reverse Transcriptase 
(BRL) following the manufacturer's protocol. 
DNA Amplification and Cloning 

The first strand cDNA synthesis reaction was used as 
template for DNA amplification by the polymerase chain reaction (PCR). 

10 The preproricin cDNA was amplified using the upstream primer Ricin-99 
(S'-CCGGGAGGAAATACTATTGTAAT-SO and the downstream primer 
Ricinl729C with Vent DNA polymerase (New England Biolabs) using 
standard procedures (Sambrook et al.. Molecular Cloning: A Laboratory 
Manual, Second Edition, (Cold Spring Harbor Laboratory Press, 1989)). 

15 Amplification was carried out in a Biometra thermal cycler (TRIO- 
Thermalcycler) using the following cycling parameters: denaturation 95**C 
for 1 min., annealing 52°C for 1 min., and extension 72°C for 2 min., (33 
cycles), followed by a final extension cycle at 72°C for 10 min. The 1846 bp 
amplified product was fractionated on an agarose gel (Sambrook et aL, 

20 Molecular Cloning: A Laboratory Manual, Second Edition, (Cold Spring 
Harbor Laboratory Press, 1989), and the DNA purified from the gel slice 
using Qiaex resin (Qiagen) following the manufacturer"s protocol. The 
purified PCR fragment encoding the preproricin cDNA was then ligated 
(Sambrook et aL, Molecular Cloning: A Laboratory Manual, Second 

25 Edition, (Cold Spring Harbor Laboratory Press, 1989)) into an Eco RV 
digested pBluescript 11 SK plasmid (Stratagene), and used to transform 
competent XLl-Blue ceUs (Stratagene). Positive clones were confirmed by 
restriction digestion of purified plasmid DNA. Plasmid DNA was extracted 
using a Qiaprep Spin Plasmid Miniprep Kit (Qiagen). 

30 DNA Sequencing 

The cloned PCR product containing the putative preproricin 
gene (pAP144) was confirmed by DNA sequencing of the entire cDNA 
clone. Sequencing was performed using an Applied Biosystems 373A 
Automated DNA Sequencer, and confirmed by double-stranded dideoxy 
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sequencing by the Sanger method using the Sequenase kit (USB) (see WO 
98/49311). 

Prndtigtion and aoning of T.inker Variants 

pAP144 cut with EcoRl was used as target for PGR pairs 
5 employing the Ricinl09-Eco oligonucleotide (Ricin-109Eco primer: 5- 
GGAGGAATCCGGAGATGAAACCGGGAGGAAATACTATTGTAAT-3) 

and a mutagenic primer for the 5' half of the linker 
as well as the Ricinl729PstI primer (Ricin 1729-PstI: 5- 
GTAGGCGCTGCAGATAACTTGCTGTCCTrTCAG-3) and a mutagenic 
10 primer for the 3' half of the linker. The cycling conditions used for the 
PCRs were 98 degrees C for 2 min.; 98°C 1 min., 52^C 1 min., 72°C 1 min. 
15 sec. (30 cycles); 72 degrees C 10 min.; 4 degrees C soak. The PGR 
products were then digested by EcoRI and PstI respectively, 
electrophoresed on an agarose gel, and the bands purified by via glass 
15 wool spin columns. Triple ligatior\s comprising the PGR product pairs 
(corresponding halves of the new linker) and pVL1393 vector digested 
with EcoRl and Pstl were carried out. Recombinant clones were identified 
by restriction digests of plasmid miniprep DNA and the altered linkers 
confirmed by DNA sequencing. Note that all altered linker variants were 
20 cloned directly into the pVL1393 vector. 
Isolation of Recombinant Baculoviruses 

Insect cells S.frugiperda (Sf9), and Trichoplusia ni (Tn368 and 
BTI-TN-581-4 (High Five)) were maintained on EX-CELL 405 medium JRH 
Biosciences) supplemented with 10% total calf serum (Summers et al., A 
25 Manual of Methods of Baculovirus Vectors and Insect Cell Culture 
Procedures, (Texas Agricultural Experiment Station, 1987)). Two 
micrograms of recombinant pVL1393 DNA was co 

transfected with 0.5 microgram of BaculoGold AcNPV DNA (Pharmingen) 
into 2 X 10^ Tn368 insect cells following the manufacturer's protocol 
30 (Gruenwald et al., Baculovinis Expression Vector System: Procedures and 
Methods Manual, 2nd Edition, (San Diego, CA, 1993)). On day 5 post- 
transfection, media were centrifuged and the supematants tested in 
limiting dilution assays with Tn368 cells (Stimmers et al., A Manual of 
Methods of Baculovirus Vectors and Insect Cell Culture Procedures, (Texas 
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Agricultural Experiment Station, 1987)). Recombinant viruses in the 
supematants were then amplified by infecting Tn368 cells at a multiplicity 
of infection (moi) of 0.1, followed by collection of day 3 to 5 supematants. 
A total of three rounds of amplification were performed for each 
5 recombinant following established procedures (Summers et al., A Manual 
of Methods of Baculovirus Vectors and Insect Cell Culture Procedures, 
(Texas Agricultural Experiment Station, 1987 and Gruenwald et al., 
Baculovirus Expression Vector System: Procedures and Methods Manual, 
2nd Edition, (San Diego, CA, 1993)). 
10 Expression of Mutan t Proricin 

Recombinant baculoviruses were used to infect 1X10^ Tn368 
or sf9 cells at an moi of 9 in EX-CELL 405 media (JRH Biosciences) with 
25mM a-lactose in spinner flasks. Media supematants containing mutant 
proricins were collected 3 or 4 days post-infection. 
15 EXAMrifE2 

Harvesting and affinity c o lumn purification of pro-ricin variants 

Protein samples were harvested three days post infection. 
The cells were removed by centrifuging the media at 8288 g for ten 
minutes using a GS3 (Sorvall) centrifuge rotor. The supernatant was 
20 further clarified by centrifuging at 25400 g using a SLA-1500 rotor (Sorvall) 
for 45 nunutes. Protease inhibitor phenylmethylsulfonyl fluoride (Sigma) 
was slowly added to a final concentration of ImM. The samples were 
further prepared by adding a-lactose to a concentration of 20 mM (not 
including the previous lactose contained in the expression medium). The 
25 samples were concentrated to 700 mL using a Prep/Scale-TFF Cartridge 
(2.5ft, lOK regenerated cellulose (Millipore)) and a Masterflex pump. The 
samples were then dialysed for 2 days in IX Column Btiffer (50 mM Tris, 
100 mM NaCl, 0.02% NaN3, pH 7.5) using dialysis tubing (10 K MWCO, 32 
mm flat width(Spectra/Por)). Subsequently, the samples were clarified by 
30 centrifuging at 25400 g using a SLA-1500 rotor (SorvaU) for 45 minutes. 

Following centrifugation, the samples were degassed and 
applied at 4 degrees C to a XK26/20 (Pharmacia) column (attached to a 
Pharmacia peristaltic pump, Pharmacia Single-path Monitor UV-1 Control 
and Optical Units, and Bromma LKB 2210 2-Channel Recorder) containing 
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20 mL a-Lactose Agarose Resin (Sigma). The column was washed for 3 
hours with IX Column buffer. Elution of pro-ricin variant was performed 
by eluting with buffer (IX Column buffer (0.1% NaN3), 100 mM Lactose) 
until the baseline was again restored. The samples were concentrated 
5 using an Amicon 8050 concentrator (Amicon) with a YMIO 76 mm 
membrane, utilizing argon gas to pressurize the chamber. The samples 
were further concentrated in Centricon 10 (Millipore) concentrators 
according to manufacturer's specifications. 

P^irifiration of Variant PAP-Pro tpm by gel filtration chromatpgyaphy 
10 In order to purify variant from processed material 

produced diuing fermentation, the protein was applied to a SUPERDEX 75 
(16/60) column and SUPERDEX 200 (16/60) column (Pharmacia) connected 
in series equilibrated with 100 mM Tris, 200 mM NaCl, pH 7.5 containing 
100 mM lactose and 1.0% p-mercaptoethanol (PME). The flow rate of the 
15 column was 0.15 mL/min and fractions were collected every 25 minutes. 
The UV (280 nm) trace was used to determine the approximate location of 
the purified PAP-protein and thus determine the samples for Western 
aneilysis. 

Western analysis of colum n fractions 
20 Fractions eluted from the SUPERDEX coltunns (Pharaiacia) 

were analyzed for purity using standard Western blotting techniques. An 
aliquot of 10 jiL from each fraction was boiled in IX sample buffer (62.6 
mM Tris-Cl, pH 6.8, 4.4% pME, 2% sodiiim dodecyl sulfate (SDS), 5% 
glycerol (all from Sigma) and 0.002% bromophenol blue (Biorad)) for five 
25 minutes. Denatured samples were loaded on 12% Tris-Glycine Gels 
(Biorad) along with 50 ng of RCA^o (Sigma) and 5 [iL of kaleidoscope 
prestained standards (Biorad). Electrophoresis was carried out for ninety 
minutes at lOOV in 25 mM Tris-Cl, pH 8.3, 0.1% SDS, and 192 mM glycine 
using the BioRad Mini Protean n cells (Biorad). 
30 Following electrophoresis gels were equilibrated in transfer 

buffer (48 mM Tris, 39 mM glycine, 0.0375% SDS, and 20% Methanol) for a 
few minutes. PVDF Biorad membrane was presoaked for one minute in 
100% methanol, rinsed in ddH20 and two minutes in transfer buffer. 
Whatman paper was soaked briefly in transfer buffer. Five pieces of 
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Whatman paper, membrane, gel, and another five pieces of Whatman 
paper were arranged on the bottom cathode (anode) of the Pharmacia 
Novablot transfer apparatus (Pharmacia). Trai\sfer was for one hour at 
constant current (2 mA/CM^). 

5 Transfer was confirmed by checking for the appearance of 

the prestained standards on the membrane. Non-specific sites on the 
membrane were blocked by incubating the blot for thirty minutes in IX 
Phosphate Buffered Saline (IX PBS; 137 mM NaCl, 2.7 mM KCl, 8 mM 
Na2HP04,1.5 mM KH2PO4, pH 7.4) with 5% skim milk powder Carnation). 

10 Primary antibody rabbit anti-ricin, (Sigma) was diluted 1:3000 in IX PBS 
containing 0.1% Tween 20 (Sigma) and 2.5% skim milk and incubated with 
blot for forty five minutes on a orbital shaker (VWR). Non-specifically 
bound primary antibody was removed by washing the blot for ten 
minutes with IX PBS containing 0.2% Tween 20. This was repeated four 

15 times. Secondary antibody doitkey anti-rabbit (Amersham) was incubated 
with the blot imder the same conditions as the primary antibody. Excess 
secondary antibody was washed as described above. Blots were 
developed with the ECL Western Blotting detection reagents according to 
the manufacturer's instructions. Blots were exposed to Medtec's Full Speed 

20 Blue Film (Medtec) or Amersham's ECL Hyperfilm (Amersham) for one 
second to five minutes. Film was developed in a KODAK Automatic 
Developer. 

Determination of lectin binding ability of pro«ricin variant 

An Immulon 2 plate (VDVR) was coated with 100 ^1 per 

25 well of 10 |xg/ml of asialofetuin and left overnight at 4*'C. The plate was 
washed with 3X 300 jiL per well with ddH20 xising an automated plate 
washer (BioRad). The plate was blocked for one hour at 37**C by adding 
300 \iL per well of PBS containing 1% ovalbumin. The plate was washed 
again as above. Pro-ricin variant PAP-protetn was added to the plate in 

30 various dilutions in IX Column Buffer, (50 mM Tris, 100 mM NaCl, pH 7.5). 
A standard curve of RCA^q (Sigma) from 1-10 ng was also included. The 
plate was incubated for 1 h at 37^C. The plate was washed as above. Anti- 
ricin monoclonal antibody (Sigma) was diluted 1:3000 in IX PBS containing 
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0.5% ovalbumin and 0.1% Tween-20, added at 100 |iL per well and 
incubated for 1 h at 37°C. The plate was washed as above. Donkey-anti 
rabbity polyclonal antibody was diluted 1:3000 in IX PBS containing 0.5% 
ovalbumin, 0.1% Tween-20, and added at 100 \ih per well and incubated 
for 1 h at 37'*C. The plate was given a final wash as described above. 
Substrate was added to plate at 100 \iL per well (1 mg/mL o- 
phenylenediamine (in HjO), 1 HL/mL H2O2) and after development 25 |J,L 
of stop solution (20% H2SO4) was added and the absorbance read 
(A490nm-A630nm) using a SPECTRA MAX 340 plate reader (Molecular 
Devices). 

Petermination of PAP ^Protein acti vity using the rabbit reticulocyte assay 
Ricin samples were prepared for reduction. 

A) RCA60 = 3,500 ng/\xL of RCAgo + 997 jiL Ix Endo buffer (25 
mM Tris, 25 mM KCl, 5mM MGCI2, pH 7.6) 

Reduction = 95 |iiL of 10 ng/^L + 5 p-mercaptoethanol 

B) Ricin variants 

Reduction = 40 jiL variant + 2 ^iL p-mercaptoethanol 

The ricin standard and the variants were incubated for 30 

minutes at room temperature. 
Ricin - Rabbit Reticulocyte lysate reaction 

The required number of 0.5 mL tubes were labelled. (2 25 
tubes for each sample, + and - aniline). To each of the sample tubes 20 \iL 
of IX endo buffer was added, and 30 |aL of buffer was added to the 
controls. To the sample tubes either 10 \iL of lOng/^iL, Ricin or 10 ^lL of 
variant was added. Finally, 30 iiL of rabbit reticulocyte lysate was added to 
all the tubes- The samples were incubated for 30 minutes at 30°C using the 
thermal block. Samples were removed from the 0.5 mL tube and contents 
added into a 1.5 mL tube containing 1 mL of TRIZOL (Gibco). Samples 
were incubated for 15 minutes at room temperattire. After the incubation, 
200 ^iL of chloroform was added, and the sample was vortexed and spim at 
12,000 g for 15 minutes at 4''C. The top aqueovis layer from the samples 
was removed and contents added to a 1 mL tube containing 500 ^iL of 
isopropanol. Samples were incubated for 15 minutes at room temperature 
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and then centrifuged at 12,000 for 15 minutes at 4°C. Supernatant was 
removed and the pellets were washed with 1 mL of 70% ethanol. 
Centrifugation at 12,000 g for 5 minutes at 4°C pelleted the RNA. All but 
approximately 20 \lL of the supernatant was removed and the RNA pellet 
5 was allowd to air dry. Pellets from the other samples (^aniline samples) 
were dissolved in 20 \ih of DEPC treated ddHjO. An 80 \ih aliquot of 1 M 
aniline (distilled) with 2.8 M acetic acid was added to these RNA samples 
and transferred to a fresh 0.5 mL tube. The samples were incubated in the 
dark for 3 minutes at 60°C. RNA was precipitated by adding 100 |lU, of 95% 
10 ethanol and 5 M.L of 3M sodium acetate, pH 5.2 to each tube and 
centrifuging at 12,000 g for 30 minutes at 4°C. Pellets were washed with 1 
mL 70% ethanol and centrifuged again at 12,000g for 5 minutes at 4°C to 
precipitate RNA, The supernatant was removed and air dried. These 
pellets were dissolved in 10 |xL of 0.1 X E buffer. To all samples, 10 jiL of 
15 formamide loading dye was added. The RNA ladder (BRL) (8 jiL of ladder 
+ 8 |xL of loading dye) was also included. Samples were incubated for 2 
minutes at 70°C on the thermal block. Electrophoresis was carried out on 
the samples using 1.2% agarose, 50% formamide gels in O.IX E buffer + 
0.2% SDS. The gel was run for 90 minutes at 75 volts. RNA was visualized 
20 by staining the gel in 1 \ig/\iL ethidium bromide in nmning buffer for 45 
minutes. The gel was examined on a 302 nm UV box, photographed using 
the gel documentation system and saved to a computer disk. 
Results: 

Protein Expression Yields 
25 Aliquots were taken at each stop of the harvesting/ 

purification and tested. Yields of functional ricin variant were determined 
by ELISA. Typical resxilts of an 2400 mL prep of infected T. ni cells are 
given below. 

Aliquot ug PAP 220 

30 Before concentration and dialysis 6000 
after concentration and dialysis 4931 
alpha- Lactose agarose column flow through 219 
alpha- Lactose agarose colvmm elution 1058 
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Yield: 1058/6000 = 17.6% 

Purification of PAP*Piotein and Western Analysis of colmnn fractions 

Partially purified PAP-protein was applied to Superdex 75 
5 and 200 (16/60) columns connected in series in order to remove the 
contaminating non-specifically processed PAP-protein. Eluted fractions 
were tested via Western analysis as described above and the fractions 
containing the most pure protein were pooled, concentrated and dialyzed 
against 1 X PBS buffer and then sterilized by filtration (Millipore). Final 
10 purified PAP-protein has less than 1% processed variant. 

The purified PAP-protein was tested for susceptibility to 
Cj cleavage by the particular protease and for activation of the A-chain of the 

|T pro-ricin variant, (inhibition of protein sjjmthesis). Typically, PAP protein 

M was incubated with and without protease for a specified time period and 

[»5 15 then electrophoresed and blotted. Cleaved PAP will run as two 30 kDa 

proteins (B is slightly larger) \mder reducing (SDS-PAGE) conditions, 
p Unprocessed PAP-protein, which contains the linker region, will migrate at 

60 kDa. 

Activation of PAP -Protein variant with Specific Protease 
y 20 Activation of protease treated PAP-protein is based on the 

method of May et al (EMBO Journal. S 301-8, 1989). Activation of ricin A 
chain upon cleavage of the intermediary linker results in catalytic 
depurination of the adenosine 4325 residue of 28S or 26S rRNA. This 
depurination renders the molecule susceptible to amine-catalyzed 

25 hydrolysis by aniline of the phosphodiester bond on either side of the 
modification site. The result is a diagnostic 390 base band. As such, 
reticulocyte ribosomes incubated with biochemically purified ricin A chain, 
released the characteristic RNA fragment upon aniline treatment of 
isolated rRNA (May, M.J. et al. Embo. Journal, 8:301-308 at 302- 303 (1989)). 

30 It is on this basis that the assay allows for the determination of activity of a 
ricin A chain which has been cleaved from the intact unit containing a 
particular variant linker sequence. 
EXAMPLES 

In Vitro Protease Digestion of Proricin Variants: 
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Affinity-purified proricin variant is treated with individual 
disease-specific proteases to confirm specific cleavage in the linker region. 
Ricin- like toxin variants are eluted from the lactose-agcirose matrix in 
protease digestion buffer (50mM NaCl, 50mM Na-acetate, pH 5.5, ImM 
dithiothreitol) containing lOOmM lactose. Proricin substrate is then 
incubated at 37°C for 60 minutes with a disease-specific protease. The 
cleavage products consisting ricin A and B chains are identified using 
SDS/PAGE (Sambrook et aL, Molecular Cloning: a Laboratory Manual, 
2nd. ed.. Cold Spring Harbor Press, 1989), followed by Western blot 
analysis using anti- ricin antibodies (Sigma). 

Matrix metalloproteinases may be prepared substantially as 
described by Lark, M.W. et aL (Proceedings of the 4th International Conference 
of the Imflammation Research Association Abstract 145 (1988)) and Welch, A.R. 
et al. (Arch. Biochem. Biophys. 324:59-64 (1995)). 

Urokinase plasminogen activator may be prepared 
substantially as described by Holmberg, L. et al. (Biochim Biophys Acta, 
445:215-222, (1976)) and Someno, T. et al. Q Biochem 97:1493-1500 (1985)). 
EXAMPLE 4 

Cytotoxicity of Ricin and Ricin Variants o n Cell Lines 
Cell Lines 

CQS-I (African Green Monkey Kidney Cells) 

This is an SV40 transformed cell line which was prepared from established 
simian cells CV-1. (Reference: Gltizman, Y. (1975) Cell, 23, 175 - 182)(ATCC 
CRL 1650) 

HT*1080 Human Fibrosarcoma 

(ATCC CCL 121) This cell line was shown to produce active 
MMP-9 in tissue culture. References: Moore et al. (1997) Gynecologic 
Oncology 65, 83-88. 
Cell Preparation 

After washing with IXPBS (0.137 M NaCl, 2.68 mM KCI, 
8.10 mM Na2HP04, 1.47 mM KH2PO4), cells in log phase growth were 
removed from plates with IX trypsin/EDTA (Gibco/BRL). The cells were 
centrifuged at 1100 rpm for 3 min, resuspended in Dulbecco's Modified 
Eagle Medium containing 10%FBS and IX pen/ strep, and then counted 
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using a haemocytometer. They were adjusted to a concentration of 5 X 10^ 
cells •ml-i. One hundred microliters per well of cells was added to wells 2B 
- 2G through to wells 9B - 9G of a Falcon 96 well tissue culture plate. A 
separate 96 well tissue culture plate was used for each sample of Ricin or 

5 Ricin variant. The plates were incubated at 37^C witfi 5% CO2 for 24 hours. 
Toxin Preparation 

The Ricin and Ricin variants were sterile filtered using a 0.22 
\im filter (Millipore). The concentration of the sterile samples were then 
quantified by A280 and confirmed by BCA measurements (Pierce). For the 

10 variants digested with the MMP-9 protease in vitro, the digests were 
carried out as described in the digestion procedure for each protease. The 
digests were then diluted in the 1000 ng*ml-^ dilution and sterile filtered. 
Ricin and Ricin variants were serially diluted to the following 
concentrations: 1000 ng*ml-^ 100 ng^ml'i, 10 ng*mH, 1 ng»ml-^ 0.1 

15 ng»ml-i, 0.01 ng^ml-^ 0.001 ng^ml*^ with media containing 10%FBS and 
IX pen/strep. 

A pplication of Toxin or Varia nts to Plates 

Columns 2 to 9 were labeled: control, 1000 ng»ml-^, 100 
ng^ml-i, 10 ng^ml-i, 1 ng*mH, 0.1 ng*ml-i, 0.01 ng*mH, 0.001 ng*ml-i 

20 consecutively. The media was removed from all the sample wells with a 
multichannel pipettor. For each plate of variant and toxin, 50 jil of media 
was added to wells 2B to 2G as the control, and 50 ^1 of each sample 
dilution was added to the corresponding columns. The plates were 
incubated for one hour at 37^C with 5% CO2, then washed once and 

25 replaced with media, then incubated for 48 hours at 37°C with 5% CO2. 

Sample Application 

The whole amoimt of media (and/or toxin)was removed 
from each well with a multichaimel pipettor, and replaced with 100 \i\ of 
the substrate mixture (Promega Cell Titer 96 Aqueous Non-Radioactive 
30 Cell Proliferation Assay Kit). The plates were incubated, at 37°C with 5% 
CO2 for 2 to 4 hours, and subsequently read with a Spectramax 340 96 well 
plate reader at 490nm. The IC50 values were calculated using the GRAFIT 
software program. 



•47- 



Results 

The results of the cytotoxicity assay are shown in Tables 1 to 
4. In almost all cases the novel variants show preferential activation in the 
tumoiir cell line HT1080 (human fibrosarcoma) as compared with the non- 
tirmouTOgenic cell line COS-1 (immortalized cell line form the kidney of an 
African green monkey). 
EXAMPLE 5 

Maximum Tolerable Dose Data - PAF304 and PAP305 

The protocol for the maximum tolerable dose (MTD) study 
involved a single intravenoiis injection of either PAP304 or PAP305 into the 
tail vein of either a Nude/SCID mouse. Three animals were used for each 
dose tested. The samples were diluted into saline solution contaiiung 100 
\ig/mL Bovine Serum Albumin on the same day as the injection. Animals 
were observed for 14 days after dosing. Any surviving animals were 
euthanized after 14 days of study. The MTD value was defined as the 
highest dose of sample tested where all animals in the group survived. 
PAP304 - 150 Jig/kg 
PAP305 - 25 jig/kg 

(c/ Ricin - 1.6 jig/kg and PAP220 r, 13 ^g/kg) 

These results demonstrate that the pAP304 and pAP305 linkers decrease 
the toxicity of the recombinant proteins. 
EXAMPLE 6 
In vivo Studies 

(a) Protocol for A431 Animal Model Studies 

Tumour growth will be monitored daily by measuring 
tumour dimensions with calipers. The treatment initiation date is 
dependent on the rate of timiour growth. When four groups (4 mice per 
group) of mice develop timiours of the desired size (50 MM^ - 100 MM^) 
the mice will be weighed and the treatment will be initiated. This 
treatment initiation date is considered as day 1, and the mice will be given 
a bolus intravenous injection of PAP304 on this day. Injections will be 
administered through the lateral tail vein. The treatment groups will be as 
shovm in Table 5. 
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All samples and buffer will be made up in saline solution 
containing 100 ^g/mL Bovine Serum Albumin, 
(b) In Vivo Efficacy Studies 

Subcutaneous A431 tumours were established in SCID mice. 
The tumours were treated with either PAP304 or PAP305 when the 
tumours reached 50 MM^ on Days 1, 5 and 9. The results shown in Figiures 
21 and 22 demonstrate that the linker decreases the toxicity of the variant 
(as compared with ricin) and the variants PAP304 and PAP305 are activated 
at or near the A431 (human epithelial carcinoma) solid tumour in mice. A 
very exciting result is shown in Figxire 21. In this study, the variant 
PAP304 was able to slow down the growth of A431 solid tumour (17 day 
delay), without any signs of dose limiting toxicity (e.g., no weight loss or 
death). 

Having illustrated and described the principles of the 
invention in a preferred embodiment, it should be appreciated to those 
skilled in the art that the invention can be modified in arrangement and 
detail without departure from such principles. We claim all modifications 
coming within the scope of the following claims. 

All publications, patents and patent applications referred to 
herein are incorporated by reference in their entirety to the same extent as 
if each individual publication, patent or patent application was specifically 
and individually indicated to be incorporated by reference in its entirety- 
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protein-3 (IGFBP-3), IGF, and lGFBP-3 protease activity in cerebrospinal 
fluid of children with leukemia, central nervous system tumor, or 
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1) Cytotoxicity of Selected Variants 



Table 1: Selected Variants against COS-1 Cells - Target Protease MMP-9 





Ricin 


PAP220 


PAP301 


PAP302 


PAP303 


PAP304 


PAP305 


PAP308 


Linker 
Length 
(residues) 




23 


23 


16 


15 


8 


12 


12 


Reduction 
in toxicity 
relative to 
Ricin 


IX 


23X 


24X 


118X 


63X 


1220X 


145X 


89X 



Table 2: Selected Variants against HT1080 Cells - Target Protease MMP-9 





Ricin 


PAP220 


PAP301 


PAP302 


PAP303 


PAP304 


PAP305 


PAP308 


Linker 
Length 
(residues) 




23 


23 


16 


15 


8 


12 


12 


Reduction 
in toxicity 
relative to 
Ricin 


IX 


4X 


5X 


24X 


12X 


137X 


38X 


21X 
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2) Cytotoxicity Data from Selected Variants 

Table 3: Selected Variants against COS-1 Cells 





Ricin 


PAP313 
a) 


PAP314 
a) 


PAP315 
a) 


PAP316 
b) 


PAP317 


PAP318 
b) 


PAP319 


Linker 
Tjengrt\ 
(residues) 




7 


15 


14 


23 


21 


22 


23 


Redviction 
in toxicity 
relative to 
Ricin 


IX 


72X 


52X 


75X 




55X 


lOOX 


155X 






Ricin 


PAP320 


PAP321 


PAP322 


PAP323 


PAP324 


PAP325 




Linker 
Length 
(residues) 








9 a) 


21 b) 


19 b) 


37 b) 




Reduction 
in toxicity 
relative to 
Ricin 


IX 






82X 


65X 


67X 

- 


82X 





a ) Target protease Urokinase Plasminogen Activator (UP A) 
b) Target protease MMP-9 
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Table 4: Selected Variants against HT1080 Cells 





Rici 
n 


PAP3 
13a) 


PAP3 
14a) 


PAP3 
15a) 


PAPS 
16b) 


PAPS 
17 


PAPS 
18b) 


PAPS 
19 


Linker 

Length 
(residues) 




7 


15 


14 


23 


21 


22 


23 


Reduction 
in toxicity 
relative to 
Ricin 


IX 


161X 


27X 


18X 




9X 


51X 


49X 










Ricin 


PAP320 


PAP321 


PAP322 
a) 


PAP323 
b) 


PAP324 
b) 


PAP325 
b) 




Linker 
Length 
(residues) 








9 


21 


19 


17 




Reduction 
in toxicity 
relative to 
Ricin 


IX 






51X 


15X 


14X 


20X 





a ) Target protease Urokinase Plasminogen Activator (UPA) 

b) Target protease MMP9 
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Table 5 



Group 


Sample 


Drug Dose 


Treatment (days) 


1 


Control - Buffer 


0 


1,5, and 9 


2 


PAP304 


75 


1, 5, and 9 


3 


PAP304 


100 


1,5, and 9 


4 


PAP304 


150 


1,5, and 9 



m 

'J 
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WE CLAIM; 

1. A purified and isolated nucleic acid molecule comprising (a) 
a nucleotide sequence encoding an A chain of a ricin-like toxin, (b) a 
nucleotide sequence encoding a B chain of a ricin-like toxin and (c) a 

5 nucleotide sequence encoding a heterologous linker amino acid sequence 
linking the A and B chains, the heterologous linker sequence containing a 
cleavage recognition site for a cancer-specific protease. 

2. A nucleic acid molecule of claim 1 wherein the cleavage 
recognition site is recognized by a cancer-associated protease which is 

10 selected from the group consisting of a matrix metalloproteinase and a 
urokinase-type plasminogen activator. 

3. A nucleic acid molecule of claim 1 wherein the A chain is 
ricin A chain, abrin toxin A chain, diphtheria toxin A chain, or Domain I of 
Pseudomonas exotoxiru 

15 4. A nucleic acid molecule of claim 1 wherein the A chain is 

volkensin toxin A chain, cholera toxin A chain, modeccin toxin A chain, 
viscumin toxin A chain or shiga toxin A chain. 

5. A nucleic acid molecule of claim 1 wherein the B chain is 
ricin B chain, abrin toxin B chain, diphtheria toxin B chain, or Domain II of 

20 Pseudomonas exotoxin. 

6. A nucleic acid molecule of claim 1 wherein the B chain is 
volkensin toxin B chain, cholera toxin B chain, modeccin toxin B chain, 
viscumin toxin B chain or shiga toxin B chain. 

7. A nucleic acid molecule according to any one of claims 1 to 6 
25 having a nucleic acid sequence selected from the group consisting of the 

nucleic acid sequence of pAP301 as shown in Figure IB; the nucleic acid 
sequence of pAP302 as shown in Figure 2B; the nucleic acid sequence of 
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pAP303 as shown in Figure 3B; the nucleic acid sequence of pAP304 as 
shown in Figure 4B; the nucleic acid sequence of pAP305 as shown in 
Figure 5B; the nucleic add 5 sequence of p AP308 as shown in Figure 6B; the 
nucleic acid sequence of pAP309 as shown in Figure 7B; the nucleic add 
sequence of pAP313 as shown in Figure 8B; the nucleic add sequence of 
p APS 14 as shown in Figure 9B; the nucleic acid sequence of pAP315 as 
shown in Figure lOB; the nucleic acid sequence of pAP316 as shown in 
Figure IIB; the nucleic acid sequence of pAP317 as shown in Figure 12B; 
the nucleic acid sequence of pAP318 as shown in Figure 13B; the nucleic 
add sequence of pAP319 as shown in Figure 14B; the nudeic acid sequence 
of pAP320 as shown in Figure 15B; the nucleic acid sequence of pAP321 as 
shown in Figure 16B; the nucleic acid sequence of pAP322 as shown in 
Figure 17B; the nucleic add sequence of pAP323 as shown in Figure 18B; 
the nucleic add sequence of pAP324 as shown in Figure 19B; and the 
nudeic acid sequence of p AP325 as shown in Figure 20B. 

8- A nudeic acid molecule according to any one of daims 1 to 7 

wherein (c) the nucleotide sequence of the linker is selected from the 

group consisting of: the nucleic acid sequence of pAP301 as shown in 
Figure 1 A; the nucleic acid sequence of pAP302 as shown in Figure 2A; the 
nudeic acid sequence of pAP303 as shown in Figure 3A; the nucleic acid 
sequence of pAF304 as shown in Figure 4A; the nucleic add sequence of 
pAF305 as shown in Figure 5A; the nucleic acid sequence of pAP308 as 
shown in Figure 6A; the nucleic acid sequence of pAP309 as shown in 
Figure 7A; tfie nudeic acid sequence of pAP313 as shown in Figure 8A; the 
nudeic acid sequence of pAP314 as shown in Figure 9A; the nucleic acid 
sequence of pAP315 as shown in Figure lOA; tiie nudeic add sequence of 
pAP316 as shown in Figure 11 A; the nucleic acid sequence of pAP317 as 
shown in Figure 12A; the nucleic acid sequence of pAP318 as shown in 
Figure 13A; the nucleic acid sequence of pAP319 as shown in Figure 14A; 
the nucleic acid sequence of pAP320 as shown in Figure 15A; the nucleic 
add sequence of pAP321 as shown in Figure 16A; the nudeic add sequence 
of pAP322 as shown in Figure 17A; the nudeic add sequence of pAP323 as 
shown in Figure 18 A; the nucleic acid sequence of pAP324 as shown in 
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Figure 19A; and the nucleic acid sequence of pAP325 as shown in Figure 
20A. 

9. A plasmid incorporating the nucleic acid molecule of any 
one of claims 1 to 8. 

10. A baculovirus transfer vector incorporating the nucleic acid 
molecule according to any one of claims 1 to 8- 

11- A recombinant protein comprising an A chain of a ricin-like 

toxin, a B chain of a ricin- like toxin and a heterologous linker amino acid 
sequence, linking the A and B chains, wherein the linker sequence contains 
a cleavage recognition site for a disease-specific protease. 

12. A recombinant protein of claim 11 wherein the A chain is 
ricin A chain, abrin toxin B chain, diphtheria toxin A chain, or Domain I of 
Pseudomonas exotoxin. 

13. A recombinant protein of claim 11 wherein the A chain is 
volkensin toxin A chain, cholera toxin A chain, modeccin toxin A chain, 
viscumin toxin A chain, or shiga toxin A chain. 

14. A recombinant protein of claim 11 wherein the B chain is 
ricin B chain, abrrn toxin B chain, diphtheria toxin B chain, or Domain n of 
Pseudomonas exotoxin. 

15. A recombinant protein of claim 11 wherein the B chain is 
volkensin toxin B chain, cholera toxin B chain, modeccin toxin B chain, 
viscumin toxin B chain, or shiga toxin B chain. 

16. A recombinant protein of claim 11 wherein the cleavage 
recognition site is recognized by a cancer-associated protease selected from 
the group consisting of a matrix metalloproteinase and a urokinase-t5^e 
plasminogen activator. 
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17. A recombinant protein of claim 11 wherein the linker amino 

acid sequence is selected from the group consisting of: the amino acid 
sequence of PAP301 as shown in Figure IC; the amino acid sequence of 
PAP302 as shown in Figure 2C; the amino acid sequence of PAP303 as 
5 shown in Figure 3C; the amino acid sequence of PAP304 as shown in 
Figure 4C; the amino acid sequence of PAP305 as shown in Figure 5C; the 
amino acid sequence of PAP308 as shown in Figure 6C; the amino acid 
sequence of PAP309 as shown in Figure 7C; the amino acid sequence of 
PAP313 as shown in Figure 8C; the amino acid sequence of PAP314 as 

10 shown in Figure 9C; the amino acid sequence of PAP315 as shown in 
Figure IOC; the amino acid sequence of PAP316 as shown in Figure IIC; 
the amino acid sequence of PAP317 as shown in Figure 12C; the amino acid 
sequence of PAP318 as shown in Figure 13C; the amino acid sequence of 
PAP319 as shown in Figure 14C; the amino acid sequence of PAP320 as 

15 shown in Figure 15C; the amino acid sequence of PAP321 as shown in 
Figure 16C; the amino acid sequence of PAP322 as shown in Figure 17C; 
the amino acid sequence of PAP323 as shown in Figxure 18C; the amino acid 
sequence of PAP324 as shown in Figure 19C; and the amino add sequence 
of PAP325 as shown in Figure 20C. 

20 18. A method of inhibiting or destroying cancer cells, which 

cells are associated with a protease specific to the cancer comprising the 
steps of: 

(a) preparing a purified and isolated nucleic acid having a 
nucleotide sequence encoding an A chain of a ridn-like toxin, a B chain of a 

25 ridn-like toxin, and a heterologous linker amino add sequence/ linking the 
A and B chains, wherein the linker sequence contains a cleavage 
recognition site for the proteeise; 

(b) introducing the nucleic acid into a host cell and 
expressing the nucleic acid in the host cell to obtain a recombinant protein 

30 comprising an A chain of a ricin-like toxin, a B chain of a ricin-like toxin and 
a linker amino add sequence; 
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(c) suspending the protein in a pharmaceutically acceptable 
carrier, diluent or excipient, and 

(d) contacting the cells with the recombinant protein. 

19. A method of inhibiting or destroying cancer cells 
comprising contacting the cells with an effective amoimt a recombinant 
protein according to any one of claims 11 to 17. 

20. A method of treating a cancer comprising administering an 
effective amoxmt of a recombinant protein according to any one of claims 
11 to 17 to an animal in need thereof. 

21. A method of treating a cancer comprising administering an 
effective amount of a nucleic acid molecule according to any one of claims 
1 to 7 to an animal in need thereof. 

22. A process for preparing a pharmaceutical for treating cancer 
comprising the steps of : 

(a) preparing a purified and isolated nucleic acid having a 
nucleotide sequence encoding an A chain of a ricin-like toxin, a B chain of a 
ricin-like toxin, and a heterologous linker amino add sequence, linking the 
A and B chains, wherein the linker sequence contains a cleavage 
recognition site for a cancer; 

(b) introducing the nucleic acid into a host cell and 
expressing the nucleic acid in the host cell to obtain a recombinant protein 
comprising an A chain of a ricin-like toxin, a B chain of a ridn-like toxin and 
a linker amino add sequence; 

(c) suspending the protein in a pharmaceutically acceptable 
carrier, diluent or exdpient. 

23. A pharmaceutical composition for treating cancer 
comprising a recombinant protein of any one of claims 11 to 17 and a 
pharmaceutically acceptable carrier, diluent or excipient. 



< 



-59- 

24. A pharmaceutical composition for treating cancer 

comprising a nucleic acid molecule of any one of claims 1 to 7 and a 
pharmaceutically acceptable carrier, diluent or excipient. 



5=5 =i 
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25. A purified and isolated nucleic acid molecule having a 
5 nucleic acid sequence selected from the group consisting of: the nucleic 

acid sequence of pAP301 as shown in Figure 1 A; the nucleic acid sequence 
of pAP302 as shown in Figure 2A; the nucleic acid sequence of pAP303 as 
shown in Figure 3A; the nucleic acid sequence of pAP304 as shown in 
Figure 4A; the nucleic add sequence of p AP305 as shown in Figure 5 A; the 
10 nucleic acid sequence of pAP308 as shown in Figure 6A; the nucleic acid 
sequence of pAP309 as shown in Figure 7A; the nucleic acid sequence of 
pAP313 as shown in Figure 8A; tiie nucleic acid sequence of pAP314 as 
shown in Figure 9A; the nucleic acid sequence of pAF315 as shown in 
Figure lOA; the nucleic acid sequence of pAP316 as shown in Figure 11 A; 
15 the nucleic acid sequence of pAP317 as shown in Figure 12A; the nucleic 
^ acid sequence of pAP318 as shown in Figure 13A; the nucleic acid sequence 

of pAP319 as shown in Figure 14A; the nucleic acid sequence of pAP320 as 
% shown in Figure 15A; the nucleic acid sequence of pAP321 as shown in 

□ Figure 16 A; the nucleic acid sequence of pAP322 as shown in Figure 17A; 

20 the nucleic acid sequence of pAP323 as shown in Figiu-e ISA; the nucleic 
acid sequence of pAP324 as shown in Figure 19 A; and the nucleic acid 
sequence of pAF325 as shown in Figure 20A. 

26. A linker protein having an amino add sequence selected 
from the group consisting of: the amino acid sequence of PAP301 as shown 

25 in Figure IC; the amino acid sequence of PAP302 as shown in Figure 2C; 
the amino acid sequence of PAP303 as shown in Figure 3C; the amino acid 
sequence of PAP304 as shown in Figure 4C; the amino acid sequence of 
FAF305 as shown in Figure 5C; the amino acid sequence of PAP308 as 
shown in Figure 6C; the am.ino acid sequence of PAP309 as shown in 

30 Figure 7C; the amino acid sequence of PAP313 as shovm in Figure 8C; the 
amino acid sequence of PAP314 as shown in Figure 9C; the amino acid 
sequence of PAP315 as shown in Figure IOC; the amino acid sequence of 
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PAP316 as shown in Figure IIC; the amino acid sequence of PAP317 as 
shown in Figure 12C; the amino acid sequence of PAP318 as shown in 
Figure 13C; the amino acid sequence of PAP319 as shown in Figure 14C; 
the amino acid sequence of PAP320 as shown in Figure 15C; the amino acid 
5 sequence of PAP321 as shown in Figure 16C; the amino acid sequence of 
PAP322 as shown in Figure 17C; the amino acid sequence of PAP323 as 
shown in Figure 18C; the amino acid sequence of PAP324 as shown in 
Figure 19C; and the amino acid sequence of PAP325 as shown in Figure 
20C. 
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ABSTRACT OF THE DISCLOSURE 

The present invention provides a protein having chain of a 
ricin-like toxin, a B chain of a ricin-like toxin and a novel heterologous 
linker amino acid sequence, linking the A and B chains. The linker 
sequence contains a cleavage recognition site for a cancer specific protease. 
The invention also relates to a nucleic acid molecule encoding the protein 
and to expression vectors incorporating the nucleic acid molecule. Also 
provided is a method of inhibiting or destro)dng cancer cells utilizing the 
nucleic acid molecules and proteins of the invention and pharmaceutical 
compositions for treating human cancer. 
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FIGURE IB CPl^ 



Sequence of pAP301 insert 

10 20 30 40 50 

I i I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCT^TACTTTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTAT/^GAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACXS 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 
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701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCTICAGTTTGGTCCTCTTGGCATGTGGGGACAACGAAATTTTAATGC 
AGCAGTGTCAAACCAGGAGAACCGTACACCCCTGTTGCTTTAAAATTACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATAGACAAOTAOATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCTAGTCTAATACAGATGCATATCAGCTCTGGAC^ 

GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 
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1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACXSTAGGAGACCGGITOCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACy^TATCACCCAACCACAATCT^ 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCT^AATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855. 
Sequence name: pAP301 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The symbol 
wiUiin the linker region designate deleted nucleotides. 
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FIGURE 2B (PI) 



Sequence of pAP302 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCTU^CAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCC AA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 T^TCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATO^GGAAGATGCAGAAGCAATCACTGATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 
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701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTCCGCAAGGAATTGCAGGGCAG 

AGCAGTGTCAAAAGAG6CGTTCCTTAACGTCCCGTC 

951 TGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 

ACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATXeCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCy^GCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGGAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCAC^ 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCA/ACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

13 51 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 

TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

14 01 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 

GT^CGTTCGTTTATCACCTGTTaiTAGCTATCTCCTtSAG^ 



FIGURE 2B (P3) 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
" GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
•'II CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

□ 1851 TGCAG 

ACGTC 



Total niunber of bases is: 1834. 
Sequence name: pAP302 

Note: Nucleotides in bold are found mtfain the mutant preproricin linker region. The symbol 
within the linker region designate deleted nucleotides. 
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FIGURE 3B rPl^ 



Sequence of pAP303 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACrcrGTACTTtATGGTa^CAA 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCyVAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCirrTCACTGATC 

TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACrCTCTTTTATAGCTCAACCCTTTACCAGGTGATCT 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCC^ 

GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTATUVCGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE 3B (V2) 

701 GATCTQCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACX3ATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGC --- 

ATAATTAGGGATAGTATCGAQAGTACCACATATCTACG 

901 TCTCCGCAAGGAATTGCAGGGCAOCGAAATTTTAATGC 

AGAGGCOTTCCTTAACGTCCCGTCGCTTTAAAATTACO 

951 TGATOTTT6TATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTOSGGTATCACGCATAGCa^TCa^GCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTAOUITCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAQATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACy^TACTATTCGATCTAATGGAAAGTGTTTAACTACTTACXS 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCX3TTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 



FIGURE 3B (P3) 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGATIATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1831. 
Sequence name: pAP303 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The symbol 
within the linker region designate deleted nucleotides. 
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FIGURE 4B fPl) 



Sequence of pAP304 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTT^GTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGtCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACT AGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE 4B (¥2) 



701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGO^TTCT^GAGTCTAACCAAGGAGCCTTTGCTAGTCC^ 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTC^ 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGAT6C 

ATAATTAGGGATAGTATCGAGAGTACCACATATCTACG 

901 TCTCCGCAAGGAATTGCAGGGCAG 

AGAGGCGTTCCTTAACGTCCCOTC 

951 TGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 

ACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAA^TAGTGGACAAGTATGGATAGAGGAGTGTAGGAGTGAAA 
GAACGTTCGTTTATCACCroTTCATAteCTATCTCCrro 



FIGURE 4B (P3> 



1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1810. 
Sequence name: pAP304 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The symbol 
within the linker region designate deleted nucleotides. 
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FIGURE 5B (PI) 



Sequence of pAP305 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CITAAGTACTTTGGCCCTCCTTTATGATAACATTATACCrrACATACGT(^ 

51 GGCT^CATGGCTTTGTTTTGGATCCACCrrCAGGGTGGTCTTTCTVCA 

CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCGAAAO^TACCa^TTATAAACTTTAC^ 

TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCACTVACGG 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGT^CTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACITGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTQAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE SB fP2) 



701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCTCCGCAAGGAATTGCAGGGCAG 

AGAGGCGTTCCTTAACGTCCCGTC 

951 TGTATGGATCCTGAGCCCATAG.TGCGTATCGTAGGTCGAAATG 

ACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACJ^CAACTACAATCCCTACCTTCTAAGGTGTTGCCIT^ 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

12 0 1 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 

TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 

AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 



FIGURE SB (P3) 



1451 AGGCTGAACAACAGTGGGCrCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTT^ 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCXnTCAATATAGCra 

1851 TGCAG 
ACGTC 



Total number of bases is: 1 822. 
Sequence name: pAP30S 

Note: Nticleotides in bold are found within the mutant preproricin linker region. The symbol 
within the linker region designate deleted nucleotides. 
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FIGURE 6B (PI) 

Sequence of pAP308 insert 

10 20 30 40 50 

I I I I i 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACT^TACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACAT(3AAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTAT.CTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGC^ 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTAOACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE 6B (P2) 



701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACC^GGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 GGTCCTCTTGGCATGTGGGGACAA 

CCAGGAGAACCGTACACCCCTGTT 

951 TGTGGTGGCGGAGGGCCCATAGTGCGTATCGTAGGTCGAAATG 

ACACCACCGCCTCCCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATAOVCAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACX5TTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGGW^ACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCT 

AATGJCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCT 

TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTtTATCACCTOTTCATACCrATCTCCTGAC^ 



FIGURE 6B (P3) 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACTU^TCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1822. 
Sequence name: pAP308 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The symbol 
within the linker region designate deleted nucleotides. 
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FIGURE 7B (PI) 



Sequence of pAP309 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTAa^CTCTGTACTTTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCA7U\TAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCT^GAAGCAATCACTCATCTTTTCACTO 

TAGTCCTTOTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCT^TATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE 7B (P2) 



701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCT^CCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCT^CACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTOOAGGTGGT 

901 TCGTCAO^GTTTGGTCCTCTTGGCATGTGGGGACAACGAAATTTTAATGC 
AGCAGTGTCAAACCAGGAGAACCGTACACCCCT6TTGCTTTAAAATTACG 

951 T6ATGTTTGTGGTGGCGGAGGGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACACCACCGCCTCCCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 



FIGURE 7B (P3^ 

1451 AGGCTGAACAACT^GTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACITTTATCrACCGAATTTAT^ 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTaUVTATAGCnTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855. 
Sequence name: pAP309 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The symbol 
within the linker region designate deleted nucleotides. 
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FIGURE 8B (PI) 



Sequence of pAP313 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCATU^CAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGGA.GAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGTU^CCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCy^TTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTAGAGTACTGGTGGCACTCAGCTTCC:^ 

GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCXSTTCCTTTATAATTTGCTITCCAAATGATTTCAG^ 

GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCTU^TATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE 8B (P2) 



701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGC 

ATAATTAGGGATAGTATCGAGAGTACCACATATCTACG 

901 CCAGGACGAGTAGTCGGCGGG 

GGTCCTGCTCATCAGCCGCCC 

951 TGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 

ACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCaAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGJCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATAGACAACCTTTTGTTACAACCy^TTGTTGGGCTATATGGTC^ 

TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCy^CCTGTTCATACCTATCTCCTGACATCGTCACT^ 



FIGURE SB (P3) 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAT^TTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCT^CACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1807. 
Sequence name: pAP3 13 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The symbol 
within the linker region designate deleted nucleotides. 
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FIGURE 9B (Fl^ 

Sequence of pAP3 14 insert 

10 20 30 40 50 

I I I I i 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCT^AACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCT^CGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

2 01 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCAC7ACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCOUIATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCT^TGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACX5TAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAG6ACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGT6AGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACaVTTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGCTTAGTitAAGO:!eTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACGGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE 9B (P2) 



701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGAC6TTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACX3TAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACZACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGGAGGCGGGGGT 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCCTCCGCCCCCA 

901 CCAGGACGAGTAGTCGGCGGG GGAG6 

GGTCCTGCTCATCAGCCGCCC CCTCC 

951 CGGGGGTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
GCCCCCAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

14 01 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 



FIGURE 9B rP3^ 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACy^AT 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAA6GACAGCAAGTTATATCGAATTCC 
CCTGTAACaLTTTAAAAGATTGACTTTCCTGTCGTTa^TATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1 83 1 . 
Sequence name: pAP314 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The *-* symbol 
within the linker region designate deleted nucleotides. 
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FIGURE lOB (PI) 



Sequence of pAP3 1 5 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGC AAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCXSGAT^CCTVCCATTAATACrATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE lOB (P2) 



701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT - 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGC 

ATAATTAGGGATAGTATCGAGAGTACCACATATCTACG 

901 CCAG6ACGAGTAGTCGGCGGGCCAGGACGAGTAGTCGGCGGG 

GGTCCTGCTCATCAGCCGCCC6GTCCTGCTCATCAGCCGCCC 

951 TGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 

ACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGG7UVACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGOf^GTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTtCATACCTATCTCCTGACATCGT^ 



FIGURE lOB rP3) 

1451 AGGCTGTU^CAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCT^AACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1828. 
Sequence name: pAP3 15 



Note: Nucleotides in bold are found within the mutant preproricin linker region. The *-* 
\sdthin the linker region designate deleted nucleotides. 
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FIGURE IIB fPl) 



Sequence of pAP3 1 6 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACTIACATATTCCCCAAACAATACCCAATTATAAACTTTACCAC^ 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTTATATTTGAT^TGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCrGTACITTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCaAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCA^ 

TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE IIB (P2) 



701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACa^GGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGGAGGCGGGGGT 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCCTCCGCCCTGA 

901 G6A6GCGGGGGTCCGCAA6GAATTGCAGG6CA66GAG06G6TAGTAGCG0 
GGTCCGCCCCCAGGCGTTCCTTAACGTCCCOTCCCTCCCCCATCATCGCC 

951 CGGGGGATGTATGGATCCTGAGCCCATAGT6CGTATCGTAGGTCGAAATG 
GCCCCCTACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTT^^ 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACa^CTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAAT ACTATTCGATCTAATGGAT^GTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 

AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

14 01 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 

GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 



FIGURE 1 IB (P3) 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCITGGTAAAATTTAAACATATCACCa^CCAC^ 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
AC<^CrGGGTTTGGTTTATACCyUVTGGTAATAAAACTATCTGTCrAAT(^ 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACAOIGGACGGTACTTTTATCTACCGAATTTAT^ 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 18SS. 
Sequence name: pAP316 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The symbol 
within the linker region designate deleted nucleotides. 
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FIGURE 12B (PI) 



Sequence of pAP317 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
AGACCAGCCGATGGCACGACCTTTATCGCGTATTVAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCrTTATTATTACAGTACTGGTGGCACTCAG^ 

GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE 12B nP2^ 

701 G ATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCTVATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCC^ 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTGTTTCGCAGAACTATCCAATAGTGCAAAATTTTACAGC 
AGCAGTGTCAAACAAAGCGTCTTGATAGGTTATCACGTTTTAAAATGTCG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCyVAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACX3TTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CITGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACTTCGTCACTTT 



FIGURE 12B fP3) 



a 

Si 



1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCITGGTAAAATTTAAACATATCACCCAACCAa^TCT^ 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 18S5. 
Sequence name: pAP317 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The symbol 
within the linker region designate deleted nucleotides. 
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FIGURE 13B rPl> 



Sequence of p AP3 1 8 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCACAA 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCA;^CGGATATTTGGTTGCaVAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTAO^TTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCy^TCTTTTCACTGATGTTC^^ 

TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGT6GCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGtCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCX3CACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCXyrGCTCTTAATCCATGTTGGCCT 



FIGURE 13B (P2> 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCT^CTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCC^ 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCTlTAGCrCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCGGAGGTTCTCCGCAAGGAATTGCA660CA6GATGAAGA6GAAT6C 
AGCAGCCTCCAAGAGGCGTTCCTTAACGTCCCOTCCTACTTCTCCTTACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 



FIGURE 13B (P3^ 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTC^ 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCGAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACT^CACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGGTTT^GG 

1851 TGCAG 
ACGTC 



Total number of bases is: 185S. 
Sequence name: pAP3 1 8 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The *-* symbol 
within the linker region designate deleted nucleotides. 
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FIGURE 14B (PI) 

Sequence of pAP319 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE 14B CP2\ 



701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCrAACCAAGGAGCCITTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTC^ 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCAGGAGGTCAGGTGGTTCAATTGCAGAATTATGATGAAGAGGATGC 
AGCAGTCCTCCAGTCCACCAAGTTAACGTCTTAATACTACTTCTCCTACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACAGAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATAGTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACX3ACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCITGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCTU^TCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGAGTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTCSAQATCGT^^ 



FIGURE 14B rP3^ 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGT^AATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1 855. 
Sequence name: pAP3 1 9 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The symbol 
within the linker region designate deleted nxicleotides. 
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FIGURE 15B (Pl^ 

Sequence of pAP320 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAJ\ACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCX5ATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCTVACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATC7ACTTGAGAGT 

301 AATCATGCaiGAGGTTTCTGTTACATTAGCXSCTGGATGTCACCAAT^ 

TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGT^AATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGSK^tSAA^ 

TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



• • # 



FIGURE 15B (P2> 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCaA 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATT7ATCCCTATCATAGCTCTCATGGTGTATAGATGCGGAGGCGGA 

ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCCTCCGCCT 

901 CCAGGACGA6TAGTCGGCG6G GG 

GGTCCTGCTCATCAGCCGCCC CC 

951 GGGAGGCTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
CCCTCCGACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAT^TACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 



FIGURE 15B 



1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTT^TGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCT^TATAGCTTAAGG 

1851 TGCAG 
AGGTC 



Total nxunber of bases is: 182S. 
Seqiience name: pAP320 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The symbol 
within the linker region designate deleted nucleotides. 
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FIGURE 16B (PI) 



Sequence of pAP321 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAA.CG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACO^TTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTATIACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE 16B (P2) 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGGAGGC 

ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCCTCCG 

901 CCAGGACGAGTAGTC6GCGGG 

GGTCCTGCTCATCA6CCGCCC 

951 - GGAGGCTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
-CCTCCGACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTA7VATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAAGGTTeGTTTATCACGTGTTCATACCraTCrrCCTGAGATCGTCA 



FIGURE 16B fP3) 



1451 AGGCTGAACT^a^GTGGGCTCTTTATGCAGATGGTTCAATACGTCCT 

TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAT^CCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACA(^GGACGGTACITTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total niimber of bases is: 1819. 
Sequence name: pAP321 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The symbol 
within the linker region designate deleted nucleotides. 
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FIGURE 17B (Pl^ 

Sequence of pAP322 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCC?U^CAATACCCAATTATAAACTTTACC^ 

TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAQAGCTGTTCGC^ 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCTU^TAAAATCAACTTGAGA 

301 AATCa.TGCAGAGerTTCrGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTAT.CTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE 17B (P2^ 



701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGGA 

ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCCT 

901 CCAGGACGAGTAGTCGGCGGG- 

GGTCCTGCTCATCAGCCGCCC 

951 GGCTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 

CCGACATACCTAGGACTCGGGTATCTICGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTT^TACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACT^TACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCAC CCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACX5TTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 



FIGURE 17B (P3> 



1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATerACCGAATTTATTTTT 

18 01 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACT^TTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1813. 
Sequence name: pAP322 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The symbol 
within the linker region designate deleted nucleotides. 
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FIGURE 18B (PI) 

Sequence of pAP323 insert 

10 20 30 40 50 

I I I i I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACT^CATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACnTTATGGTCACAACGGT^ 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCT6GATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGC7VATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE 18B (P2^ 



701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCy^CACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGGAGGCGGGACT 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCCTCCGCCCTGA 

901 CCAGGG GGTCCGCAAGGAATTGCAGGGCAG GGGGGTAGTAGCGG 

6GTCCC CCAGGCGTTCCTTAACGTCCCGTC CCCCCATCATCGCC 

951 CGGGGGATGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
GCCCCCTACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACTVCAACTACAATCCCTACCTTCTAAGGTGTTGCCT^ 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTAGTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAT^AATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

13 51 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCGTGACATCX3TCACTTT 



FIGURE 18B (F3) 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCT^GTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1849. 
Sequence name: pAP323 

Note: Nucleotides in bold are found within the mutant preproricin linker region. Hie 
within the linker region designate deleted nucleotides. 
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FIGURE 19B fPl> 



Sequence f pAP324 insert 

10 20 30 40 50 

I I I i I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGT7VATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACy^CTCTGTACTTTATGGTCTlCAACGGTO 

251 ACAGAGTTGGTlTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCy^AACGGATATTTGGTTGCCT^AATAAAATC^ 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCX3ACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAT^GAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAAT ATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCT^CT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTAGAACCGGA 
TAAGGTTATATAAGTCCCrCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE 19B (F2) 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGGAGGCGGGACT 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCCTCCGCCCTGA 

901 CCA 6GTCCGCAAGGAATT6CA6GGCAG 6GTAGTAGCGG 

6GT CCAOGCGTTCCTTAACGTCCCQTC CCATCATCGCC 

951 CGGGGGATGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
GCCCCCTACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATT^TGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCT^CCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 



FIGURE 1911 ( P^) 

1451 AGGCTGAACAAOUSTCXXSCTCm-ATGCAGATCMTTCAATACGTCCTCA^ 
TCCGACTTGTTGTCAOCCGAGAAATACGTCTACCAAOT?!?^^^ 

SS^S^^^^^'^^'TTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGQAATGTTCACTAAGATTATATGCCCTT^ 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 

acaattctaggagagaacaccgggacgtaggagaccS??SctIot^ 

rS^^^^^^T^^CCy^TTTTAAATTTCTATAGTGGGTTGGTGTTAG^^ 

agttcttactaccttggtaaaatttaaacatatcaccctuvcScStct^ 

1651 gtgaggcgatcggatccgagccttaaacaaatcattctttaccctctcca 
cactccgctagcctaggctcggaatttctttagtaagaaatggSgI^ 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTaATAGACAGATTACT 

accactgggtttggtoataccaatggtaataaaactatSgtct^^ 

^""^^ JT2n''''^°^°^°'^^^^^*^^°^^TGAAAATAGATGGCTTAAATAAAAA 

gagaacgtcacacacacaggacggtacttttatctaccgaatita 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCn 
CCTGTAACATITAAAACATTGACTrrCCTGTaSTTcSTlTlS^ 

1851 TGCAG 
AGGTC 



Total number of bases is: 1843. 



Sequence name: pAP324 



Note: Nucleotides in bold are foimd within them ^h^ , 

within the linker region designate deleted nucleotides. ^^^&on. i tie symbol 
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FIGURE 20B (PI) 

Sequence of pAP32S insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACT^TACCCAATTATAAACTTTACCA 

TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGAAATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTTTATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAA/^ 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCX3GAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACrCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAT^TATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 



FIGURE 20B (P2^ 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACa^GGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGGAGGCGGGACT 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCCTCCGCCCTGA 

901 CCA CCGCAAGGAATTGCAGGGCAG A6TAGCGG 

GGT GGCGTTCCTTAACGTCCCGTC TCATC6CC 

551 CGGGGGATGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
GCCCCCTACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAT^TATGGGATAATGGAACCATCATATU^TCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

14 01 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 
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14 51 AGGCTGAACT^aiGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGT^TACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTAC7AGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGG7ATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGGTTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCCAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTOVCACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCT^TATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1837. 
Sequence name: pAP325 

Note: Nucleotides in bold are found within the mutant preproricin linker region. The symbol 
within the linker region designate deleted nucleotides. 
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